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INTRODUCTION 


athematics is the science of structure, order, 

and relation that has evolved from ele- 

mental practices of counting, measuring, 
and describing the shapes of objects. It deals with logical 
reasoning and quantitative calculation, and its develop- 
ment has involved an increasing degree of idealization 
and abstraction of its subject matter. Since the 17th cen- 
tury, mathematics has been an indispensable adjunct to 
the physical sciences and technology, and in more recent 
times it has assumed a similar role in the quantitative 
aspects of the life sciences. 

In many cultures—under the stimulus of the needs 
of practical pursuits, such as commerce and agricul- 
ture—mathematics has developed far beyond basic 
counting. This growth has been greatest in societies com- 
plex enough to sustain these activities and to provide lei- 
sure for contemplation and the opportunity to build on 
the achievements of earlier mathematicians. 

All mathematical systems (for example, Euclidean 
geometry) are combinations of sets of axioms and of the- 
orems that can be logically deduced from the axioms. 
Inquiries into the logical and philosophical basis of math- 
ematics reduce to questions of whether the axioms of a 
given system ensure its completeness and its consistency. 

As a consequence of the exponential growth of 
science, most mathematics has developed since the 15th 
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century CE. This does not mean, however, that earlier 
developments have been unimportant. Indeed, to under- 
stand the history of modern mathematics, it is necessary 
to know its history at least in Mesopotamia and Egypt, 
in ancient Greece, and in Islamic civilization from the 
9th to the 15th century. These civilizations influenced 
one another and Greek and Islamic civilization made 
important direct contributions to later developments. 
For example, India’s contributions to the development 
of contemporary mathematics were made through the 
considerable influence of Indian achievements on Islamic 
mathematics during its formative years. 

It is important to be aware of the character of the 
sources for the study of the history of mathematics. The 
history of Mesopotamian and Egyptian mathematics 
is based on the extant original documents written by 
scribes. Although in the case of Egypt these documents 
are few, they are all of a type and leave little doubt that 
Egyptian mathematics was, on the whole, elementary 
and profoundly practical in its orientation. For Meso- 
potamian mathematics, on the other hand, there are a 
large number of clay tablets, which reveal mathematical 
achievements of a much higher order than those of the 
Egyptians. The tablets indicate that the Mesopotamians 
had a great deal of remarkable mathematical knowledge, 
although they offer no evidence that this knowledge was 
organized into a deductive system. Future research may 
reveal more about the early development of mathematics 
in Mesopotamia or about its influence on Greek mathe- 
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matics, but it seems likely that this picture of Mesopota- 
mian mathematics will stand. 

From the period before Alexander the Great, no 
Greek mathematical documents have been preserved 
except for fragmentary paraphrases, and, even for the 
subsequent period, it is well to remember that the oldest 
copies of Euclid’s Elements are in Byzantine manuscripts 
dating from the 10th century CE. This stands in complete 
contrast to the situation described above for Egyptian 
and Babylonian documents. Although in general outline 
the present account of Greek mathematics is secure, in 
such important matters as the origin of the axiomatic 
method, the pre-Euclidean theory of ratios, and the dis- 
covery of the conic sections, historians have given com- 
peting accounts based on fragmentary texts, quotations 
of early writings culled from nonmathematical sources, 
and a considerable amount of conjecture. 

Many important treatises from the early period of 
Islamic mathematics have not survived or have survived 
only in Latin translations, so that there are still many 
unanswered questions about the relationship between 
early Islamic mathematics and the mathematics of Greece 
and India. In addition, the amount of surviving material 
from later centuries is so large in comparison with that 
which has been studied that it is not yet possible to offer 
any sure judgment of what later Islamic mathematics did 
not contain, and therefore it is not yet possible to eval- 
uate with any assurance what was original in European 
mathematics from the 11th to the 15th century. 


INTRODUCTION 


From ancient times mathematics was practiced 
at an extremely high level in South Asia (modern 
India, Pakistan, Nepal, Sri Lanka, Afghanistan, and 
Bangladesh) and East Asia (China, Japan, Korea, and 
Vietnam). During and after the 19th century, South and 
East Asian mathematics merged with the modern West- 
ern stream of mathematics. 

The mathematics of classical Indian civilization 
is an intriguing blend of the familiar and the strange. 
For the modern individual, Indian decimal place-value 
numerals may seem familiar—and, in fact, they are the 
ancestors of the modern decimal number system. Famil- 
iar too are many of the arithmetic and algebraic tech- 
niques involving Indian numerals. On the other hand, 
Indian mathematical treatises were written 1n verse form, 
and they generally do not share modern mathematics’ 
concern for rigorously structured formal proofs. Some 
historians of mathematics have deplored these aspects 
of the Indian tradition, seeing in them merely a habit 
of rote memorization and an inability to distinguish 
between true and false results. In fact, explanations and 
demonstrations were frequently added by later com- 
mentators, but these were sometimes described as “for 
the slow-witted.” For the traditional Indian teacher of 
mathematics, a demonstration was perhaps not so much 
a solid foundation for the student’s understanding as 
a crutch for the weak student’s lack of understanding. 
The Indian concept of ganita (Sanskrit: “computation”) 
was a form of knowledge whose mastery implied varied 
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talents: a good memory, swift and accurate mental arith- 
metic, enough logical power to understand rules without 
requiring minute explanations, and a sort of numerical 
intuition that aided in the construction of new methods 
and approximations. 

In modern times the invention of printing has largely 
solved the problem of obtaining secure texts and has 
allowed historians of mathematics to concentrate their 
editorial efforts on the correspondence or the unpub- 
lished works of mathematicians. However, the exponen- 
tial growth of mathematics means that, for the period 
from the 19th century on, historians are able to treat only 
the major figures in any detail. In addition, there is, as the 
period gets nearer the present, the problem of perspec- 
tive. Mathematics, like any other human activity, has its 
fashions, and the nearer one 1s to a given period, the more 
likely these fashions will look like the wave of the future. 
For this reason, this volume makes no attempt to assess 
the most recent developments in the subject. 


THE FOUNDATIONS OF 
MATHEMATICS 
So2F SoSH 


athematicians have long studied the logical 
M and philosophical basis of mathematics, 

including whether the axioms of a given sys- 
tem ensure its completeness and its consistency. Because 
mathematics has served as a model for rational inquiry 
in the West and is used extensively in the sciences, foun- 
dational studies have far-reaching consequences for the 
reliability and extensibility of rational thought itself. 

A remarkable amount of practical mathematics, 
some of it even fairly sophisticated, was already devel- 
oped as early as 2000 BCE by the agricultural civiliza- 
tions of Egypt and Mesopotamia and perhaps even far- 
ther east. However, the first to exhibit an interest in the 
foundations of mathematics were the ancient Greeks. 


ARITHMETIC OR GEOMETRY 


Early Greek philosophy was dominated by a dispute 
as to which is more basic, arithmetic or geometry, and 
thus whether mathematics should be concerned primar- 
ily with the (positive) integers or the (positive) reals, the 
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latter then being conceived as ratios of geometric quan- 
tities. (The Greeks confined themselves to positive num- 
bers, as negative numbers were introduced only much 
later in India by Brahmagupta.) Underlying this dispute 
was a perceived basic dichotomy, not confined to math- 
ematics but pervading all nature: is the universe made 
up of discrete atoms (as the philosopher Democritus 
believed) which hence can be counted, or does it con- 
sist of one or more continuous substances (as Thales of 
Miletus is reputed to have believed) and thus can only be 
measured? This dichotomy was presumably inspired by a 
linguistic distinction, analogous to that between English 
count nouns, such as “apple,” and mass nouns, such as 
“water.” As Aristotle later pointed out, in an effort to 
mediate between these divergent positions, water can be 
measured by counting cups. 

The Pythagorean school of mathematics, founded 
on the doctrines of the Greek philosopher Pythagoras, 
originally insisted that only natural and rational num- 
bers exist. Its members only reluctantly accepted the 
discovery that y2, the ratio of the diagonal of a square 
to its side, could not be expressed as the ratio of whole 
numbers. The remarkable proof of this fact has been 
preserved by Aristotle. 

The contradiction between rationals and reals was 
finally resolved by Eudoxus of Cnidus, a disciple of Plato, 
who pointed out that two ratios of geometric quantities 
are equal if and only if they partition the set of (positive) 
rationals in the same way, thus anticipating the German 


Mosaic floor depicting Plato (sitting, middle) teaching a 
group of disciples of the Academy of Athens 


mathematician Richard Dedekind (1831-1916), who 
defined real numbers as such partitions. 


Another dispute among pre-Socratic philosophers was more 
concerned with the physical world. Parmenides claimed that 
in the real world there is no such thing as change and that 
the flow of time 1s an illusion, a view with parallels in the 
Einstein-Minkowski four-dimensional space-time model 
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of the universe. Heracleitus, on the other hand, asserted 
that change is all-pervasive and is reputed to have said that 
one cannot step into the same river twice. 

Zeno of Elea, a follower of Parmenides, claimed 
that change is actually impossible and produced 
four paradoxes to show this. The most famous of these 
describes a race between Achilles and a tortoise. Since 
Achilles can run much faster than the tortoise, let us say 
twice as fast, the latter is allowed a head start of one 
mile. When Achilles has run one mile, the tortoise will 
have run half as far again—that is, half a mile. When 
Achilles has covered that additional half-mile, the tor- 
toise will have run a further quarter-mile. After n + 1 
stages, Achilles has run 


1+ Z tenet = a - 
5 Pi = pa 
miles and the tortoise has run 
1+ u ae Setar a + = 
y) tee a" ae 


miles, being still 1/2" *" miles ahead. So how can Achilles 
ever catch up with the tortoise? 

Zeno’s paradoxes may also be interpreted as show- 
ing that space and time are not made up of discrete 
atoms but are substances which are infinitely divisible. 
Mathematically speaking, his argument involves the sum 
of the infinite geometric progression 
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no finite partial sum of which adds up to 2. As Aristo- 
tle would later say, this progression is only potentially 
infinite. It is now understood that Zeno was trying to 
come to grips with the notion of limit, which was not 
formally explained until the 19th century, although 
a Start in that direction had been made by the French 
encyclopaedist Jean Le Rond d’Alembert (1717-83). 


total distance f! 1 


sh 
A 
Col— 


Zeno's paradox, illustrated by Achilles racing 
a tortoise 
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UNIVERSALS 


The Athenian philosopher Plato believed that mathemat- 
ical entities are not just human inventions but have a real 
existence. For instance, according to Plato, the number 
2 is an ideal object. This is sometimes called an “idea,” 
from the Greek eide, or “universal,” from the Latin uni- 
versalis, meaning “that which pertains to all.” But Plato 
did not have in mind a “mental image,” as “idea” is usu- 
ally used. The number 2 is to be distinguished from a 
collection of two stones or two apples or, for that matter, 
two platinum balls in Paris. 

What, then, are these Platonic ideas? Already in 
ancient Alexandria some people speculated that they are 
words. This is why the Greek word /ogos, originally meaning 
“word,” later acquired a theological meaning as denoting 
the ultimate reality behind the “thing.” An intense debate 
occurred in the Middle Ages over the ontological status of 
universals. Three dominant views prevailed: realism, from 
the Latin res (“thing”), which asserts that untversals have 
an extra-mental reality—that is, they exist independently 
of perception; conceptualism, which asserts that universals 
exist as entities within the mind but have no extra-mental 
existence; and nominalism, from the Latin nomen (“name’’), 
which asserts that universals exist neither in the mind nor in 
the extra-mental realm but are merely names that refer to 
collections of individual objects. 

It would seem that Plato believed in a notion of 
truth independent of the human mind. In the Meno Pla- 


THE FOUNDATIONS OF MATHEMATICS 


to’s teacher Socrates asserts that it is possible to come to 
know this truth by a process akin to memory retrieval. 
Thus, by clever questioning, Socrates managed to bring 
an uneducated person to “remember,” or rather to recon- 
struct, the proof of a mathematical theorem. 


THE Axiomatic METHOD 


Perhaps the most important contribution to the foun- 
dations of mathematics made by the ancient Greeks was 
the axiomatic method and the notion of proof. This was 
insisted upon in Plato’s Academy and reached its high point 
in Alexandria about 300 BCE with Euclid’s Elements. This 
notion survives today, except for some cosmetic changes. 

The idea is this: there are a number of basic math- 
ematical truths, called axioms or postulates, from which 
other true statements may be derived in a finite number 
of steps. It may take considerable ingenuity to discover a 
proof; but it is now held that it must be possible to check 
mechanically, step by step, whether a purported proof 
is indeed correct, and nowadays a computer should be 
able to do this. The mathematical statements that can be 
proved are called theorems, and it follows that, in prin- 
ciple, a mechanical device, such as a modern computer, 
can generate all theorems. 

Two questions about the axiomatic method were left 
unanswered by the ancients: are all mathematical truths 
axioms or theorems (this is referred to as completeness), 
and can it be determined mechanically whether a given 
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statement is a theorem (this is called decidability)? These 
questions were raised implicitly by David Hilbert (1862— 
1943) about 1900 and were resolved later in the negative, 
completeness by the Austrian-American logician Kurt 
Godel (1906-78) and decidability by the American logi- 
cian Alonzo Church (1903-95). 

Euclid’s work dealt with number theory and geom- 
etry, essentially all the mathematics then known. Since 
the middle of the 20th century a gradually changing 
group of mostly French mathematicians under the 
pseudonym Nicolas Bourbaki tried to emulate Euclid in 
writing a new Elements of Mathematics based on their 
theory of structures. Unfortunately, they just missed out 
on the new ideas from category theory. 


NuMBER SYSTEMS 


While the ancient Greeks were familiar with the positive 
integers, rationals, and reals, zero (used as an actual num- 
ber instead of denoting a missing number) and the nega- 
tive numbers were first used in India, as far as is known, 
by Brahmagupta in the 7th century CE. Complex num- 
bers were introduced by the Italian Renaissance math- 
ematician and physician Gerolamo Cardano (1501-76), 
not just to solve equations such as x + 1=0 but because 
they were needed to find real solutions of certain cubic 
equations with real coefficients. Much later, the German 
mathematician Carl Friedrich Gauss (1777-1855) proved 
the fundamental theorem of algebra, that all equations 
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with complex coefficients have complex solutions, thus 
removing the principal motivation for introducing new 
numbers. Still, the Irish mathematician Sir William 
Rowan Hamilton (1805-65) and the French mathema- 
tician Olinde Rodrigues (1794-1851) invented quaterni- 
ons in the mid-19th century, but these proved to be less 
popular in the scientific community until quite recently. 
Currently, a logical presentation of the number sys- 
tem, as taught at the university level, would be as follows: 


N-Z>-Q-R-C HH. 


Here the letters, introduced by Nicolas Bourbaki, 
refer to the natural numbers, integers, rationals, reals, 
complex numbers, and quaternions, respectively, and the 
arrows indicate inclusion of each number system into the 
next. However, as has been shown, the historical devel- 
opment proceeds differently: 


N 3Q' >R S>R-C-HH, 


where the plus sign indicates restriction to positive ele- 
ments. This is the development, up to R, which 1s often 
adhered to at the high-school level. 


CALCULUS REOPENS FOUNDATIONAL QUESTIONS 


Although mathematics flourished after the end of the 
Classical Greek period for 800 years in Alexandria and, 
after an interlude in India and the Islamic world, again in 
Renaissance Europe, philosophical questions concerning 
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the foundations of mathematics were not raised until the 
invention of calculus and then not by mathematicians 
but by the philosopher George Berkeley (1685-1753). 
Sir Isaac Newton in England and Gottfried Wilhelm 
Leibniz in Germany had independently developed the cal- 
culus on a basis of heuristic rules and methods markedly 
deficient in logical justification. As is the case in many 
new developments, utility outweighed rigour, and, though 
Newton’s fluxions (or derivatives) and Leibniz’s infinites- 
imals (or differentials) lacked a coherent rational expla- 
nation, their power in answering heretofore unanswerable 
questions was undeniable. Unlike Newton, who made little 
effort to explain and justify fluxions, Leibniz, as an emi- 
nent and highly regarded philosopher, was influential in 
propagating the idea of infinitesimals, which he described 
as infinitely small actual numbers—that 1s, less than 1/n in 
absolute value for each positive integer n and yet not 
equal to zero. Berkeley, concerned over the deterministic 
and atheistic implications of philosophical mechanism, 
set out to reveal contradictions in the calculus in his influ- 
ential book The Analyst; or, A Discourse Addressed to an 
Infidel Mathematician. There he scathingly wrote about 
these fluxions and infinitesimals, “They are neither finite 
quantities, nor quantities infinitely small, nor yet nothing. 
May we not call them the ghosts of departed quantities?” 
and further asked, “Whether mathematicians, who are so 
delicate in religious points, are strictly scrupulous in their 
own science? Whether they do not submit to authority, 
take things upon trust, and believe points inconceivable?” 
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Berkeley’s criticism was not fully met until the 19th 
century, when it was realized that, in the expression dy/ 
dx, dx and dy need not lead an independent existence. 
Rather, this expression could be defined as the limit of ordi- 
nary ratios Ay/Ax, as Ax approaches zero without ever being 
zero. Moreover, the notion of limit was then explained quite 
rigorously, in answer to such thinkers as Zeno and Berkeley. 

It was not until the middle of the 20th century that 
the logician Abraham Robinson (1918-74) showed that the 
notion of infinitesimal was in fact logically consistent and 
that, therefore, infinitesimals could be introduced as new 
kinds of numbers. This led to a novel way of presenting the 
calculus, called nonstandard analysis, which has, however, 
not become as widespread and influential as it might have. 

Robinson’s argument was this: if the assumptions 
behind the existence of an infinitesimal € led to a contra- 
diction, then this contradiction must already be obtain- 
able from a finite set of these assumptions, say from: 


O<B,E<1,B<o,---, Bee, 


But this finite set is consistent, as is seen by taking 
&=1/(n+ 1). 
Non-EuctiDEAN GEOMETRIES 


When Euclid presented his axiomatic treatment of geom- 
etry, one of his assumptions, his fifth postulate, appeared 
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to be less obvious or fundamental than the others. As it 
is now conventionally formulated, it asserts that there is 
exactly one parallel to a given line through a given point. 
Attempts to derive this from Euclid’s other axioms did 
not succeed, and, at the beginning of the 19th century, it 
was realized that Euclid’s fifth postulate is, in fact, inde- 
pendent of the others. It was then seen that Euclid had 
described not the one true geometry but only one of a 
number of possible geometries. 


ELLIPTIC AND HyPERBOLIC GEOMETRIES 


Within the framework of Euclid’s other four postulates 
(and a few that he omitted), there were also possible 
elliptic and hyperbolic geometries. In plane elliptic 
geometry there are no parallels to a given line through a 
given point; it may be viewed as the geometry of a spher- 
ical surface on which antipodal points have been iden- 
tified and all lines are great circles. This was not viewed 
as revolutionary. More exciting was plane hyperbolic 
geometry, developed independently by the Hungarian 
mathematician Janos Bolyai (1802-60) and the Rus- 
sian mathematician Nikolay Lobachevsky (1792-1856), 
in which there is more than one parallel to a given line 
through a given point. This geometry is more difficult 
to visualize, but a helpful model presents the hyper- 
bolic plane as the interior of a circle, in which straight 
lines take the form of arcs of circles perpendicular to 
the circumference. 


sphere hyperbolic 
surface > 


elliptical space 
42at+ 28+ 2y>100° 


hyperbolic space 
4e+ Z8+ Z~*160° 


Euclidean space 
4@+ 28+ Zr =150° 
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Contrasting triangles in Euclidean, elliptic, and hyperbolic 


spaces 


Another way to distinguish the three geometries is 
to look at the sum of the angles of a triangle. It is 180° 
in Euclidean geometry, as first reputedly discovered by 
Thales of Miletus (flourished 6th century BCE), whereas 
it is more than 180° in elliptic geometry and less than 
180° in hyperbolic geometry. 


RIEMANNIAN GEOMETRY 


The discovery that there is more than one geometry was 
of foundational significance and contradicted the Ger- 
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man philosopher Immanuel Kant (1724-1804). Kant 
had argued that there is only one true geometry, Euclid- 
ean, which is known to be true a priori by an inner fac- 
ulty (or intuition) of the mind. For Kant, and practically 
all other philosophers and mathematicians of his time, 
this belief in the unassailable truth of Euclidean geom- 
etry formed the foundation and justification for further 
explorations into the nature of reality. With the discov- 
ery of consistent non-Euclidean geometries, there was 
a subsequent loss of certainty and trust in this innate 
intuition, and this was fundamental in separating math- 
ematics from a rigid adherence to an external sensory 
order (no longer vouchsafed as “true”) and led to the 
growing abstraction of mathematics as a self-contained 
universe. This divorce from geometric intuition added 
impetus to later efforts to rebuild assurance of truth on 
the basis of logic. 

What then is the correct geometry for describing the 
space (actually space-time) we live in? It turns out to be 
none of the above, but a more general kind of geometry, 
as was first discovered by the German mathematician 
Bernhard Riemann (1826-66). In the early 20th cen- 
tury, Albert Einstein showed, in the context of his gen- 
eral theory of relativity, that the true geometry of space 
is only approximately Euclidean. It is a form of Rieman- 
nian geometry in which space and time are linked in a 
four-dimensional manifold, and it is the curvature at each 
point that is responsible for the gravitational “force” at 
that point. Einstein spent the last part of his life trying 
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to extend this idea to the electromagnetic force, hoping 
to reduce all physics to geometry, but a successful unified 
field theory eluded him. 


CANTOR 


In the 19th century, the German mathematician Georg 
Cantor (1845-1918) returned once more to the notion 
of infinity and showed that, surprisingly, there is not 
just one kind of infinity but many kinds. In particu- 
lar, while the set N of natural numbers and the set of 
all subsets of N are both infinite, the latter collection 
is More numerous, in a way that Cantor made precise, 
than the former. He proved that N, Z, and Q all have 
the same size, since it is possible to put them into one- 
to-one correspondence with one another, but that R is 
bigger, having the same size as the set of all subsets 
of N. 

However, Cantor was unable to prove the 
so-called continuum hypothesis, which asserts that there 
is no set that is larger than N yet smaller than the set of its 
subsets. It was shown only in the 20th century, by Gédel 
and the American logician Paul Cohen (1934-2007), 
that the continuum hypothesis can be neither proved nor 
disproved from the usual axioms of set theory. Cantor 
had his detractors, most notably the German mathema- 
tician Leopold Kronecker (1823-91), who felt that Can- 
tor’s theory was too metaphysical and that his methods 
were not sufficiently constructive. 
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THE QUEST FOR RIGOUR 


SET THEORETIC BEGINNINGS 


While laying rigorous foundations for mathematics, 
19th-century mathematicians discovered that the lan- 
guage of mathematics could be reduced to that of set 
theory (developed by Cantor), dealing with member- 
ship (€) and equality (=), together with some rudi- 
mentary arithmetic, containing at least symbols for 
zero (0) and successor (S$). Underlying all this were 
the basic logical concepts: conjunction (A), disjunc- 
tion (V), implication (D), negation (7), and the univer- 
sal (V) and existential (4) quantifiers (formalized by the 
German mathematician Gottlob Frege [1848—1925)). 
(The modern notation owes more to the influence of 
the English logician Bertrand Russell [1872-1970] and 
the Italian mathematician Giuseppe Peano [1858-— 
1932] than to that of Frege.) 

For some time, logicians were obsessed with 
a principle of parsimony, called Ockham’s razor, 
which justified them in reducing the number of 
these fundamental concepts, for example, by defin- 
ing p D q (read p implies g) as 7p V q or even as “(p A 
aq). While this definition, even if unnecessarily cum- 
bersome, is legitimate classically, it is not permitted 
in intuitionistic logic. In the same spirit, many math- 
ematicians adopted the Wiener-Kuratowski definition 
of the ordered pair <a, b> as {{a}, {a, b}}, where {a} 
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is the set whose sole element is a, which disguises its 
true significance. 

Logic had been studied by the ancients, in particular 
by Aristotle and the Stoic philosophers. Philo of Megara 
(flourished c. 250 BCE) had observed (or postulated) 
that p D gis false if and only if p is true and gq 1s false. Yet 
the intimate connection between logic and mathematics 
had to await the insight of 19th-century thinkers, in par- 
ticular Frege. 

Frege was able to explain most mathematical notions 
with the help of his comprehension scheme, which asserts 
that, for every # (formula or statement), there should 
exist a set X such that, for all x, x ¢ X if and only if (x) 
is true. Moreover, by the axiom of extensionality, this 
set X is uniquely determined by (x). A flaw in Frege’s 
system was uncovered by Russell, who pointed out some 
obvious contradictions involving sets that contain them- 
selves as elements—e.g., by taking (x) to be “(x € x). 
Russell illustrated this by what has come to be known 
as the barber paradox: A barber states that he shaves all 
who do not shave themselves. Who shaves the barber? 
Any answer contradicts the barber’s statement. To avoid 
these contradictions Russell introduced the concept 
of types, a hierarchy (not necessarily linear) of elements 
and sets such that definitions always proceed from more 
basic elements (sets) to more inclusive sets, hoping that 
self-referencing and circular definitions would then be 
excluded. With this type distinction, x ¢ X only if X is of 
an appropriate higher type than x. 
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The type theory proposed by Russell, later developed 
in collaboration with the English mathematician Alfred 
North Whitehead (1861—1947) in their monumental Prin- 
cipia Mathematica (1910-13), turned out to be too cum- 
bersome to appeal to mathematicians and logicians, 
who managed to avoid Russell’s paradox in other ways. 
Mathematicians made use of the Neumann-Géodel-Ber- 
nays set theory, which distinguishes between small sets 
and large classes, while logicians preferred an essentially 
equivalent first-order language, the Zermelo-Fraenkel 
axioms, which allow one to construct new sets only as 
subsets of given old sets. Mention should also be made 
of the system of the American philosopher Willard Van 
Orman Quine (1908-2000), which admits a universal set. 
(Cantor had not allowed such a “biggest” set, as the set 
of all its subsets would have to be still bigger.) Although 
type theory was greatly simplified by Alonzo Church and 
the American mathematician Leon Henkin (1921-2006), it 
came into its own only with the advent of category theory. 


FOUNDATIONAL LoGic 


The prominence of logic in foundations led some people, 
referred to as logicists, to suggest that mathematics is a 
branch of logic. The concepts of membership and equal- 
ity could reasonably be incorporated into logic, but what 
about the natural numbers? Kronecker had suggested 
that, while everything else was made by man, the nat- 
ural numbers were given by God. The logicists, however, 
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believed that the natural numbers were also man-made, 
inasmuch as definitions may be said to be of human origin. 

Russell proposed that the number 2 be defined as 
the set of all two-element sets, that is, XY € 2 if and only 
if X has distinct elements x and y and every element 
of X is either x or y. The Hungarian-born American 
mathematician John von Neumann (1903-57) sug- 
gested an even simpler definition, namely that XY e 2 
if and only if ¥ = 0 or X = 1, where 0 is the empty set 
and | is the set consisting of 0 alone. Both definitions 
require an extralogical axiom to make them work—the 
axiom of infinity, which postulates the existence of an 
infinite set. Since the simplest infinite set is the set of 
natural numbers, one cannot really say that arithme- 
tic has been reduced to logic. Most mathematicians 
follow Peano, who preferred to introduce the natural 
numbers directly by postulating the crucial properties 
of 0 and the successor operation S, among which one 
finds the principle of mathematical induction. 

The logicist program might conceivably be saved by 
a 20th-century construction usually ascribed to Church, 
though he had been anticipated by the Austrian philos- 
opher Ludwig Wittgenstein (1889-1951). According to 
Church, the number 2 1s the process of iteration; that 1s, 
2 1s the function which to every function / assigns its iter- 
ate 2(f) = fO f, where (fO f)(x) = f(/(x)). There are some 
type-theoretical difficulties with this construction, but these 
can be overcome if quantification over types is allowed; this 
is finding favour in theoretical computer science. 
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IMPREDICATIVE CONSTRUCTIONS 


A number of 19th-century mathematicians found fault 
with the program of reducing mathematics to arithmetic 
and set theory as suggested by the work of Cantor and 
Frege. In particular, the French mathematician Henri 
Poincaré (1854-1912) objected to impredicative con- 
structions, which construct an entity of a certain type in 
terms of entities of the same or higher type—.e., self-ref- 
erencing constructions and definitions. For example, 
when proving that every bounded nonempty set_X of real 
numbers has a least upper bound a, one proceeds as fol- 
lows. (For this purpose, it will be convenient to think of 
a real number, following Dedekind, as a set of rationals 
that contains all the rationals less than any element of 
the set.) One lets x ¢ aif and only if x € y for some y € X; 
but here y is of the same type as a. 

It would seem that to do ordinary analysis one 
requires impredicative constructions. Russell and White- 
head tried unsuccessfully to base mathematics on a pred- 
icative type theory; but, though reluctant, they had to 
introduce an additional axiom, the axiom of reducibil- 
ity, which rendered their enterprise impredicative after 
all. More recently, the Swedish logician Per Martin-Lof 
presented a new predicative type theory, but no one 
claims that this is adequate for all of classical analysis. 
However, the German-American mathematician Her- 
mann Weyl (1885-1955) and the American mathemati- 
cian Solomon Feferman have shown that impredicative 
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NONCONSTRUCTIVE ARGUMENTS 


Another criticism of the Cantor-Frege program was raised 
by Kronecker, who objected to nonconstructive arguments, 
such as the following proof that there exist irrational num- 
bers a and b such that ab is rational. If 


ie” 


is rational, then the proof is complete; otherwise take 


‘t 
J 


and b = V2, so that ab = 2. The argument is nonconstruc- 
tive, because it does not tell us which alternative holds, even 
though more powerful mathematics will, as was shown by the 
Russian mathematician Aleksandr Osipovich Gelfond (1906— 
68). In the present case, the result can be proved construc- 
tively by taking a = V2 and b = 2log,3. But there are other 
classical theorems for which no constructive proof exists. 
Consider, for example, the statement 


Ax(Ayo(y) > bX), 


which symbolizes the statement that there exists a person 
who is famous if there are any famous people. This can be 
proved with the help of De Morgan’s laws, named after the 
English mathematician and logician Augustus De Morgan 
(1806-71). It asserts the equivalence of 4,0(y) with “V,70(y), 
using classical logic, but there is no way one can construct 
such an x, for example, when $(x) asserts the existence of 
a well-ordering of the reals, as was proved by Feferman. 
An ordered set is said to be well-ordered if every nonempty 
subset has a least element. It had been shown by the Ger- 
man mathematician Ernst Zermelo (1871-1951) that every 
set can be well-ordered, provided one adopts another axiom, 
(Continued on the next page) 
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(Continued from the previous page) 
the axiom of choice, which says that, for every nonempty 
family of nonempty sets, there is a set obtainable by picking 
out exactly one element from each of these sets. This axiom 
is a fertile source of nonconstructive arguments. 


arguments such as the above can often be circumvented 
and are not needed for most, if not all, of analysis. On 
the other hand, as was pointed out by the Italian com- 
puter scientist Giuseppe Longo (born 1947), impred- 
icative constructions are extremely useful in computer 
science—namely, for producing fixpoints (entities that 
remain unchanged under a given process). 


INTUITIONISTIC LoGic 


The Dutch mathematician L.E.J. Brouwer (1881-1966) 
in the early 20th century had the fundamental insight that 
such nonconstructive arguments will be avoided if one 
abandons a principle of classical logic which lies behind 
De Morgan’s laws. This is the principle of the excluded 
third (or excluded middle), which asserts that, for every 
proposition p, either p or not p; and equivalently that, 
for every p, not not p implies p. This principle is basic to 
classical logic and had already been enunciated by Aris- 
totle, though with some reservations, as he pointed out 
that the statement “there will be a sea battle tomorrow” 
is neither true nor false. 

Brouwer did not claim that the principle of the 
excluded third always fails, only that 1t may fail in the 
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presence of infinite sets. Of two natural num- 
bers x and y one can always decide whether x = yorx#y, 
but of two real numbers this may not be possible, as one 
might have to know an infinite number of digits of their 
decimal expansions. Similar objections apply to De Mor- 
gan’s laws, a consequence of the principle of the excluded 
third. For a finite set A, if it has been shown that the 
assertion Vxe A70(x) leads to a contradiction, 4yeye(x) 
can be verified by looking at each element of 4 in turn; 
1.e., the statement that no members of a given set have a 
certain property can be disproved by examining in turn 
each element of the set. For an infinite set A, there is no 
way in which such an inspection can be carried out. 

Brouwer’s philosophy of mathematics 1s called intu- 
itionism. Although Brouwer himself felt that mathemat- 
ics was language-independent, his disciple Arend Heyt- 
ing (1898-1980) set up a formal language for first-order 
intuitionistic arithmetic. Some of Brouwer’s later follow- 
ers even studied intuitionistic type theory, which differs 
from classical type theory only by the absence of a single 
axiom (double negation): 


Vxe O(77xDx), 


where 0) is the type of truth-values. 

While it cannot be said that many practicing mathema- 
ticians have followed Brouwer in rejecting this principle on 
philosophical grounds, it came as a great surprise to people 
working in category theory that certain important catego- 
ries called topoi (singular: topos) have associated with them 
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a language that is intuitionistic in general. In consequence 
of this fact, a theorem about sets proved constructively was 
immediately seen to be valid not only for sets but also for 
sheaves, which, however, lie beyond the scope of this article. 

The moderate form of intuitionism considered here 
embraces Kronecker’s constructivism but not the more 
extreme position of finitism. According to this view, 
which goes back to Aristotle, infinite sets do not exist, 
except potentially. In fact, it is precisely in the presence 
of infinite sets that intuitionists drop the classical princi- 
ple of the excluded third. 

An even more extreme position, called ultrafinitism, 
maintains that even very large numbers do not exist, say 
numbers greater than 10010" Of course, the vast major- 
ity of mathematicians reject this view by referring to 
190" + 1, but the true believers have subtle ways of 
getting around this objection, which, however, lie beyond 
the scope of this discussion. 


OTHER Loacics 


While intuitionistic logic is obtained from classical logic by 
dropping the principle of the excluded third, other logics 
have also been proposed, though none has had a compa- 
rable impact on the foundations of mathematics. One may 
mention many-valued, or multivalued, logics, which admit a 
finite number of truth-values; fuzzy logic, with an imprecise 
membership relationship (though, paradoxically, a precise 
equality relation); and quantum logic, where conjunction 
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may be only partially defined and implication may not be 
defined at all. Perhaps more important have been various 
so-called substructural logics in which the usual properties 
of the deduction symbol are weakened: relevance logic is 
studied by philosophers, linear logic by computer scientists, 
and a noncommutative version of the latter by linguists. 


FORMALISM 


Russell’s discovery of a hidden contradiction in Frege’s 
attempt to formalize set theory, with the help of his sim- 
ple comprehension scheme, caused some mathematicians 
to wonder how one could make sure that no other con- 
tradictions existed. Hilbert’s program, called formalism, 
was to concentrate on the formal language of mathemat- 
ics and to study its syntax. In particular, the consistency 
of mathematics, which may be taken, for instance, to be 
the metamathematical assertion that the mathematical 
statement 0 = | is not provable, ought to be a metatheo- 
rem—that 1s, provable within the syntax of mathematics. 
This formalization project made sense only if the syn- 
tax of mathematics was consistent, for otherwise every 
syntactical statement would be provable, including that 
which asserts the consistency of mathematics. 
Unfortunately, a consequence of Géddel’s incom- 
pleteness theorem is that the consistency of mathematics 
can be proved only in a language which 1s stronger than 
the language of mathematics itself. Yet, formalism is not 
dead—in fact, most pure mathematicians are tacit for- 
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malists—but the naive attempt to prove the consistency 
of mathematics in a weaker system had to be abandoned. 
While no one, except an extremist intuitionist, will 
deny the importance of the language of mathematics, most 
mathematicians are also philosophical realists who believe 
that the words of this language denote entities in the real 
world. Following the Swiss mathematician Paul Bernays 
(1888-1977), this position is also called Platonism, since 
Plato believed that mathematical entities really exist. 


GODEL 


Implicit in Hilbert’s program had been the hope that the 
syntactic notion of provability would capture the seman- 
tic notion of truth. Gddel came up with the surprising 
discovery that this was not the case for type theory and 
related languages adequate for arithmetic, as long as the 
following assumptions are insisted upon: 


1. The set of theorems (provable statements) is effec- 
tively enumerable, by virtue of the notion of proof 
being decidable. 


2. The set of true statements of mathematics is 
w-complete in the following sense: given any for- 
mula (x), containing a free variable x of type N, 
the universal statement Vyey@(x) will be true if 
(n) is true for each numeral n—that is, for n = 
0,n = SO, n = SSO, and so on. 


3. The language is consistent. 
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Actually, Godel also made a somewhat stronger 
assumption, which, as the American mathematician 
J. Barkley Rosser later showed, could be replaced by 
assuming consistency. Gddel’s ingenious argument was 
based on the observation that syntactical statements 
about the language of mathematics can be translated 
into statements of arithmetic, hence into the language 
of mathematics. It was partly inspired by an argument 
that supposedly goes back to the ancient Greeks and 
which went something like this: Epimenides says that 
all Cretans are liars; Epimenides is a Cretan; hence Epi- 
menides is a liar. Under the assumptions | and 2, Godel 
constructed a mathematical statement g that is true but 
not provable. If it is assumed that all theorems are true, 
it follows that neither g nor ~¢g is a theorem. 

No mathematician doubts assumption 1; by looking 
at a purported proof of a theorem, suitably formalized, 
it is possible for a mathematician, or even a computer, 
to tell whether it is a proof. By listing all proofs in, say, 
alphabetic order, an effective enumeration of all theo- 
rems is obtained. Classical mathematicians also accept 
assumption 2 and therefore reluctantly agree with Godel 
that, contrary to Hilbert’s expectation, there are true 
mathematical statements which are not provable. 

However, moderate intuitionists could draw a dif- 
ferent conclusion, because they are not committed to 
assumption 2. To them, the truth of the universal state- 
ment Vye,@(x) can be known only if the truth of (7) is 
known, for each natural number n, in a uniform way. This 
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would not be the case, for example, if the proof of @(n) 
increases in difficulty, hence in length, with n. Moderate 
intuitionists might therefore identify truth with prov- 
ability and not be bothered by the fact that neither g nor 
“g is true, as they would not believe in the principle of 
the excluded third in the first place. 

Intuitionists have always believed that, for a state- 
ment to be true, its truth must be knowable. Moreover, 
moderate intuitionists might concede to formalists that 
to say that a statement is known to be true is to say 
that it has been proved. Still, some intuitionists do not 
accept the above argument. Claiming that mathematics 
is language-independent, intuitionists would state that in 
Godel’s metamathematical proof of his incompleteness 
theorem, citing m-completeness to establish the truth of 
a universal statement yields a uniform proof of the latter 
after all. 

Godel considered himself to be a Platonist, inas- 
much as he believed in a notion of absolute truth. He 
took it for granted, as do many mathematicians, that the 
set of true statements is w-complete. Other logicians are 
more skeptical and want to replace the notion of truth 
by that of truth in a model. In fact, Godel himself, in his 
completeness theorem, had shown that for a mathemati- 
cal statement to be provable it is necessary and sufficient 
that it be true in every model. His incompleteness theo- 
rem now showed that truth in every w-complete model is 
not sufficient for provability. This point will be returned 
to later, as the notion of model for type theory is most 
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easily formulated with the help of category theory, 
although this is not the way Gédel himself proceeded. 


RECURSIVE DEFINITIONS 


Peano had observed that addition of natural numbers 
can be defined recursively thus: 


x+Q0=x,x + Sy = Sx +y). 

Other numerical functions N° > N that can be 
defined with the help of such a recursion scheme (and 
with the help of 0, S, and substitution) are called prim- 
itive recursive. Gddel used this concept to make precise 
what he meant by “effectively enumerable.” A set of 
natural numbers is said to be recursively enumerable if 
it consists of all fm) with n e N, where f: N- Nisa 
primitive recursive function. 

This notion can easily be extended to subsets 
of N° and, by a simple trick called arithmetization, to 
sets of strings of words in a language. Thus Gédel was 
able to assert that the set of theorems of mathematics is 
recursively enumerable, and, more recently, the Ameri- 
can linguist Noam Chomsky (born 1928) could say that 
the set of grammatical sentences of a natural language, 
such as English, is recursively enumerable. 

It is not difficult to show that all primitive recur- 
sive functions can be calculated. For example, to calcu- 
late x + y when x = 3 and y = 2, making use of Peano’s 
recursive definition of x + y and of the definitions 1 
= SO, 2 = S1, and so on, one proceeds as follows: 
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34+2=S2+ Sl =S(S2+ 1) = S(S2 +S0) 


= S(S2 + 0) = $2 = SS3 = S4= 5. 

But primitive recursive functions are not the only 
numerical functions that can be calculated. More general 
are the recursive functions, where f: N > N is said to be 
recursive if its graph is recursively enumerable—that is, if 
there exist primitive recursive functions u, v : N— Nsuch 
that, for all natural numbers x and y, y = f(x) if and only 
if, for some z € N, x = u(z) and y = v(z). 

All recursive functions can be calculated with pencil 
and paper or, even more primitively, by moving pebbles 
(calculi in Latin) from one location to another, using some 
finite set of instructions, nowadays called a program. 
Conversely, only recursive functions can be so calculated, 
or computed by a theoretical machine introduced by the 
English mathematician Alan Turing (1912-54), or by a 
modern computer, for that matter. The Church-Turing 
thesis asserts that the informal notion of calculability is 
completely captured by the formal notion of recursive 
functions and hence, in theory, replicable by a machine. 

Gédel’s incompleteness theorem had proved that 
any useful formal mathematical system will contain unde- 
cidable propositions—propositions which can be neither 
proved nor disproved. Church and Turing, while seeking 
an algorithmic (mechanical) test for deciding theorem- 
hood and thus potentially deleting nontheorems, proved 
independently, in 1936, that such an algorithmic method 
was impossible for the first-order predicate logic. The 
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Church-Turing theorem of undecidability, combined 
with the related result of the Polish-born American math- 
ematician Alfred Tarski (1902-83) on undecidability of 
truth, eliminated the possibility of a purely mechanical 
device replacing mathematicians. 


CompuTERS AND PROOF 


While many mathematicians use computers only as word 
processors and for the purpose of communication, com- 
puter-assisted computations can be useful for discovering 
potential theorems. For example, the prime number the- 
orem was first suggested as the result of extensive hand 
calculations on the prime numbers up to 3,000,000 by the 
Swiss mathematician Leonhard Euler (1707-83), a process 
that would have been greatly facilitated by the availability 
of a modern computer. Computers may also be helpful 
in completing proofs when there are a large number of 
cases to be considered. The renowned computer-aided 
proof of the four-colour mapping theorem by the Amer- 
ican mathematicians Kenneth Appel (1932-2013) and 
Wolfgang Haken (born 1928) even goes beyond this, as 
the computer helped to determine which cases were to be 
considered in the next step of the proof. Yet, in principle, 
computers cannot be asked to discover proofs, except in 
very restricted areas of mathematics—such as elementary 
Euclidean geometry—where the set of theorems happens 
to be recursive, as was proved by Tarski. 

As the result of earlier investigations by Turing, 
Church, the American mathematician Haskell Brooks 
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Curry (1900-82), and others, computer science has 
itself become a branch of mathematics. Thus, in theo- 
retical computer science, the objects of study are not 
just theorems but also their proofs, as well as calcu- 
lations, programs, and algorithms. Theoretical com- 
puter science turns out to have a close connection to 
category theory. 


CATEGORY THEORY 


ABSTRACTION IN MATHEMATICS 


One recent tendency in the development of mathematics 
has been the gradual process of abstraction. The Norwe- 
gian mathematician Niels Henrik Abel (1802-29) proved 
that equations of the fifth degree cannot, in general, be 
solved by radicals. The French mathematician Evariste 
Galois (1811-32), motivated in part by Abel’s work, 
introduced certain groups of permutations to determine 
the necessary conditions for a polynomial equation to be 
solvable. These concrete groups soon gave rise to abstract 
groups, which were described axiomatically. Then it was 
realized that to study groups it was necessary to look at 
the relation between different groups—in particular, at 
the homomorphisms which map one group into another 
while preserving the group operations. Thus people 
began to study what is now called the concrete category 
of groups, whose objects are groups and whose arrows 
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are homomorphisms. It did not take long for concrete 
categories to be replaced by abstract categories, again 
described axiomatically. 

The important notion of a category was intro- 
duced by Samuel Eilenberg and Saunders Mac Lane 
at the end of World War II. These modern categories 
must be distinguished from Aristotle’s categories, 
which are better called types in the present context. A 
category has not only objects but also arrows (referred 
to also as morphisms, transformations, or mappings) 
between them. 

Many categories have as objects sets endowed with 
some structure and arrows, which preserve this structure. 
Thus, there exist the categories of sets (with empty struc- 
ture) and mappings, of groups and group-homomor- 
phisms, of rings and ring-homomorphisms, of vector 
spaces and linear transformations, of topological spaces 
and continuous mappings, and so on. There even exists, 
at a still more abstract level, the category of (small) cat- 
egories and functors, as the morphisms between catego- 
ries are called, which preserve relationships among the 
objects and arrows. 

Not all categories can be viewed in this con- 
crete way. For example, the formulas of a deductive 
system may be seen as objects of a category whose 
arrows f: A > B are deductions of B from A. In fact, 
this point of view is important in theoretical computer 
science, where formulas are thought of as types and 
deductions as operations. 
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More formally, a category consists of (1) a collection 
of objects A, B, C,..., (2) for each ordered pair of objects 
in the collection an associated collection of transforma- 
tions including the identity 1 : A > A, and (3) an associ- 
ated law of composition for each ordered triple of objects 
in the category such that for f: A > Band g : B— Cthe 
composition gf (or g O /f) is a transformation 
from A to C—.e., gf: A > C. Additionally, the associative 
law and the identities are required to hold (where the com- 
positions are defined)—.e., h(gf) = (hg)fand 1,f=f= fla. 

In a sense, the objects of an abstract category have 
no windows, like the monads of Leibniz. To infer the 
interior of an object A one need only look at all the 
arrows from other objects to A. For example, in the cat- 
egory of sets, elements of a set A may be represented by 
arrows from a typical one-element set into A. Similarly, 
in the category of small categories, if 7 is the category 
with one object and no nonidentity arrows, the objects of 
a category 4 may be identified with the functors 7 > A. 
Moreover, if 21s the category with two objects and one 
nonidentity arrow, the arrows of 4 may be identified 
with the functors 2?— A. 


ISOMORPHIC STRUCTURES 


An arrow f : A > Bis called an isomorphism if there 
is an arrow g : B > A inverse to f—that is, such 
that g O f= laand fO g = 1,. This is written A & B, 
and A and B are called isomorphic, meaning that they 
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have essentially the same structure and that there is no 
need to distinguish between them. Inasmuch as mathe- 
matical entities are objects of categories, they are given 
only up to isomorphism. Their traditional set-theoret- 
ical constructions, aside from serving a useful purpose 
in showing consistency, are really irrelevant. 

For example, in the usual construction of 
the ring of integers, an integer is defined as an equivalence 
class of pairs (m, n) of natural numbers, where (m,n) is 
equivalent to (m’, n’) if and only if m + n' = m' +n. The 
idea is that the equivalence class of (m,n) 1s to be viewed 
as m — n. What is important to a categorist, however, 
is that the ring Z of integers is an initial object in the 
category of rings and homomorphisms—that is, that for 
every ring R there is a unique homomorphism Z —> R. 
Seen in this way, Z is given only up to isomorphism. In 
the same spirit, it should be said not that Z is contained 
in the field Q of rational numbers but only that the 
homomorphism Z > Q is one-to-one. Likewise, it makes 
no sense to speak of the set-theoretical intersection of 
n and y-1, if both are expressed as sets of sets of sets 
(CE infinitum). 

Of special interest in foundations and elsewhere 
are adjoint functors (FG). These are pairs of functors 
between two categories # and &, which go in opposite 
directions such that a one-to-one correspondence exists 
between the set of arrows F(A) > Bin & and the set 
of arrows A > G(B) in «#—that is, such that the sets 
are isomorphic. 
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Topos THEORY 


The original purpose of category theory had been 
to make precise certain technical notions of algebra 
and topology and to present crucial results of diver- 
gent mathematical fields in an elegant and uniform 
way, but it soon became clear that categories had an 
important role to play in the foundations of mathe- 
matics. This observation was largely the contribu- 
tion of the American mathematician F.W. Lawvere 
(born 1937), who elaborated on the seminal work of 
the German-born French mathematician Alexandre 
Grothendieck (1928-2014) in algebraic geometry. At 
one time he considered using the category of (small) 
categories (and functors) itself for the foundations of 
mathematics. Though he did not abandon this idea, 
later he proposed a generalization of the category of 
sets (and mappings) instead. 

Among the properties of the category of sets, Law- 
vere singled out certain crucial ones, only two of which 
are mentioned here: 


1. There is a one-to-one correspondence between 
subsets B of A and their characteristic functions y 
: A > {true, false}, where, for each element a of A, 
x(a) = true if and only if ais in B. 


2. Given an element a of A anda functionh : A> A, 


there is a unique function f : N > A such that f(n) 
=h'(a). 
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Suitably axiomatized, a category with these 
properties is called an (elementary) topos. However, 
in general, the two-element set {true, false} must be 
replaced by an object 0 with more than two truth-val- 
ues, though a distinguished arrow into 021 is still labeled 
as true. 


INTUITIONISTIC TYPE THEORIES 


Topoi are closely related to intuitionistic type theories. 
Such a theory is equipped with certain types, terms, 
and theorems. 

Among the types there should be a type © for 
truth-values, a type N for natural numbers, and, for 
each type A, a type go(A) for all sets of entities of 
type A. 

Among the terms there should be in particular: 


1. The formulas a = a' andaeaof type Q, if aand a’ 
are of type A and a is of type go(A) 


2. The numerals 0 and Sv of type N, if the numeral 71s 
of type N 


3. The comprehension term {x € Alh(x)} of type 
go(A), if (x) 1s a formula of type 0 containing a 
free variable x of type A. 


The set of theorems should contain certain obvious axi- 
oms and be closed under certain obvious rules of infer- 
ence, neither of which will be spelled out here. 
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At this point the reader may wonder what happened 
to the usual logical symbols. These can all be defined— 
for example, universal quantification 


VP) as {x € Alp(x)} = {x € Alx = x} 


and disjunction 


pVvgasvV.((pIHNIdI(qDH D0). 

In general, the set of theorems will not be recur- 
sively enumerable. However, this will be the case for pure 
intuitionistic type theory Z,, in which types, terms, and 
theorems are all defined inductively. In Y, there are no 
types, terms, or theorems other than those that follow 
from the definition of type theory. Y, is adequate for 
the constructive part of the usual elementary mathe- 
matics—arithmetic and analysis—but not for meta- 
mathematics, if this is to include a proof of Gédel’s 
completeness theorem, and not for category theory, if 
this is to include the Yoneda embedding of a small cat- 
egory into a set-valued functor category. 


INTERNAL LANGUAGE 


It turns out that each topos £ has an internal lan- 
guage L( ), an intuitionistic type theory whose types 
are objects and whose terms are arrows of &. Con- 
versely, every type theory ¥ generates a topos T(Y), 
by the device of turning (equivalence classes of) terms 
into objects, which may be thought of as denoting sets. 
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Nominalists may be pleased to note that every topos 
= is equivalent (in the sense of category theory) to the 
topos generated by a language—namely, the internal lan- 
guage of #. On the other hand, Platonists may observe 
that every type theory & has a conservative extension 
to the internal language of a topos—namely, the topos 
generated by Y, assuming that this topos exists in the 
real (ideal) world. Here, the phrase “conservative exten- 
sion” means that Y can be extended to L7(#) without 
creating new theorems. The types of L7(H) are names 
of sets in Y and the terms of L7(L) may be identified 
with names of sets in Y for which it can be proved that 
they have exactly one element. This last observation pro- 
vides a categorical version of Russell’s theory of descrip- 
tions: if one can prove the unique existence of an x of 
type A in Y such that p(x), then this unique x has a name 
i CZ), 

The interpretation of a type theory Y in a topos 
= means an arrow Y > L(= ) in the category of type theo- 
ries or, equivalently, an arrow T(L) > = in the category of 
topoi. Indeed, 7 and L constitute a pair of adjoint functors. 


GODEL AND CATEGORY THEORY 


It is now possible to reexamine Gédel’s theorems from 
a categorical point of view. In a sense, every inter- 
pretation of F in a topos & may be considered as a 
model of &, but this notion of model is too general, 
for example, when compared with the models of clas- 


THE FOUNDATIONS OF MATHEMATICS 
te 


sical type theories et by Henkin. Therefore, it is 
preferable to restrict #* to being a special ie of topos 
called eee Given an arrow p into 2 in £, then, p is 
true in & if p coincides with the arrow ae in &, or, 
equivalently, if p is a theorem in the internal language 
of #. & is eae a local topos oe that (1) 0 = 1 
is not true in“, (2) p V q is true in“ only if pis true in 
2 or qis true in =’, and (3) Jc) is true in & 2 if 
()(a) is true in & for some arrow a: 1— Ain =. Here 
the statement 0 = | in provision | can be replaced by 
any other contradiction—e.g., by V,_ ,t, which says that 
every proposition is true. 

A moe of Y is an interpretation of in a local 
topos £. Gédel’s completeness theorem, generalized to 
intuitionistic type theory, may now be stated as follows: 
A closed formula of # is a theorem if and only if it is 
true in every model of Y. 

Gédel’s incompleteness theorem, generalized like- 
wise, says that, in the usual language of arithmetic, it is 
not enough to look only at w-complete models: Assuming 
that Y is consistent and that the theorems of Y are recur- 
sively enumerable, with the help of a decidable notion of 
proof, there is a closed formula g in Y, which is true in 
every w-complete model, yet gis not a theorem in Y. 


THE SEARCH FOR A DISTINGUISHED MODEL 


A Platonist might still ask whether, among all the models 
of the language of mathematics, there is a distinguished 
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model, which may be considered to be the world of 
mathematics. Take as the language Y, pure intuitionistic 
type theory. It turns out, somewhat surprisingly, that the 
topos generated by Y, is a local topos; hence, the unique 
interpretation of Y, in the topos generated by it may 
serve as a distinguished model. 

This so-called free topos has been constructed 
linguistically to satisfy any formalist, but it should 
also satisfy a moderate Platonist, one who is willing 
to abandon the principle of the excluded third, inas- 
much as the free topos is the initial object in the cate- 
gory of all topoi. Hence, the free topos may be viewed, 
in the words of Leibniz, as the best of all possible 
worlds. More modestly speaking, the free topos is to 
an arbitrary topos like the ring of integers is to an 
arbitrary ring. 

The language Y, should also satisfy any construc- 
tivist: if an existential statement 4 _ p(x) can be proved 
in Z,, then p(a) can be proved for some term a of type A; 
moreover, if p V g can be proved, then either p can be 
proved or g can be proved. 

The above argument would seem to make a strong 
case for the acceptance of pure intuitionistic type theory 
as the language of elementary mathematics—that is, of 
arithmetic and analysis—and hence for the acceptance 
of the free topos as the world of mathematics. Nonethe- 
less, most practicing mathematicians prefer to stick to 
classical mathematics. In fact, classical arguments seem 
to be necessary for metamathematics—for example, in 
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the usual proof of Gédel’s completeness theorem—even 
for intuitionistic type theory. 

In this connection, one celebrated consequence 
of Gédel’s incompleteness theorem may be recalled, 
to wit: the consistency of Y cannot be proved (via 
arithmetization) within &. This is not to say that it 
cannot be proved in a stronger metalanguage. Indeed, 
to exhibit a single model of HY would constitute such 
a proof. 

It is more difficult to make a case for the clas- 
sical world of mathematics, although this is what 
most mathematicians believe in. This ought to be a 
distinguished model of pure classical type theory 7. 
Unfortunately, Godel’s argument shows that the inter- 
pretation of &, in the topos generated by it is not a 
model in this sense. 


BooLeAN Loca. Topo! 


A topos ts said to be Boolean if its internal language is 
classical. It is named after the English mathematician 
George Boole (1815-64), who was the first to give an 
algebraic presentation of the classical calculus of prop- 
ositions. A Boolean topos is local under the following 
circumstances. The disjunction property (2) holds in 
a Boolean topos if and only if, for every closed for- 
mula p, either p is true or 7p is true. Moreover, with the 
help of De Morgan’s laws, the existence property (3) 
may then be rephrased thus: if (qa) is true for all closed 
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terms a of type A, then V__ ,p(x) is true. As it turns out, 
a Boolean local topos may be described more simply, 
without referring to the internal language, as a topos 
with the following property: if f, g: A — B are arrows 
such that fa = ga for alla: 1 > A, then f= g. (Here | 1s 
the so-called terminal object, with the property that, for 
each object C, there is a unique arrow C — 1.) For the 
Boolean topos to be w-complete requires furthermore 
that all numerals—that is, closed terms of type N in its 
internal language—be standard—that is, have the form 
0, SO, SSO, and so on. 

Of course, Gddel’s completeness theorem shows 
that there are plenty of Boolean local topoi to model 
pure classical type theory in, but the usual proof of 
their existence requires nonconstructive arguments. 
It would be interesting to exhibit at least one such 
model constructively. 

As a first step toward constructing a distinguished 
w-complete Boolean model of Z, one might wish to define 
the notion of truth in #,, as induced by this model. Tar- 
ski had shown how truth can be defined for classical 
first-order arithmetic, a language that admits, aside from 
formulas, only terms of type N. Tarski achieved this 
essentially by incorporating w-completeness into the defi- 
nition of truth. It is not obvious whether his method can 
be extended to classical higher-order arithmetic—that 1s, 
to classical type theory. In fact, Tarski himself showed 
that the notion of truth is not definable (in a technical 
sense) in such a system. If his notion of definability cor- 
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responds to what is here meant by constructibility, then 
it is possible to conclude that, indeed, no Boolean model 
can be constructed. 

One may be tempted to consider as a candidate for 
the distinguished Boolean local topos the so-called von 
Neumann universe. This is defined as the union of a 
class of sets containing the empty set (the initial object 
in the category of sets) and closed under the power-set 
operation and under transfinite unions—thus, as a sub- 
category of the category of sets. But what is the cate- 
gory of sets if not the distinguished Boolean local topos 
being sought? 

A better candidate may be Gédel’s construct- 
ible universe, whose original purpose was to serve as 
a model of Zermelo-Fraenkel set theory in which the 
continuum hypothesis holds. It is formed like the von 
Neumann universe, except that the notion of subset, 
implicit in the power-set operation, is replaced by that 
of definable subset. Is it possible that this universe can 
be constructed syntactically, like the free topos, without 
reference to any previously given category of sets, or by 
a universal property? 

In the internal language of a Boolean local topos, 
the logical connectives and quantifiers have their natural 
meanings. In particular, quantifiers admit a substitu- 
tional interpretation, a desirable property that has been 
discussed by philosophers (among them, Russell and the 
American logician Saul Kripke [born 1940])—to wit: if 
an existential statement is true, then it can be witnessed 
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by a term of appropriate type in the language; and a uni- 
versal statement is true if it is witnessed by all terms of 
the appropriate type. 

Note that, in the internal language of the free topos, 
and therefore in pure intuitionistic type theory, the sub- 
stitutional interpretation is valid for existential quan- 
tifiers, by virtue of the free topos being local, but that 
it fails for universal quantifiers, in view of the absence 
of @-completeness and the fact that in the free topos 
all numerals are standard. For a Boolean local topos, 
@-completeness will also ensure that all numerals are 
standard, so that numerals mean exactly what they are 
intended to mean. 


One DistTINGUISHED Mopet or Many Mobpe ts 


Some mathematicians do not believe that a distin- 
guished world of mathematics should be sought at all, 
but rather that the multiplicity of such worlds should 
be looked at simultaneously. A major result in alge- 
braic geometry, due to Alexandre Grothendieck, was 
the observation that every commutative ring may be 
viewed as a continuously variable local ring, as Lawvere 
would put it. In the same spirit, an amplified version 
of Goédel’s completeness theorem would say that every 
topos may be viewed as a continuously variable local 
topos, provided sufficiently many variables (Henkin 
constants) are adjoined to its internal language. Put in 
more technical language, this makes the possible worlds 
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of mathematics stalks of a sheaf. However, the ques- 
tion still remains as to where this sheaf lives if not in a 
distinguished world of mathematics or—perhaps better 
to say—metamathematics. 

These observations suggest that the foundations of 
mathematics have not achieved a definitive shape but are 
still evolving; they form the subject of a lively debate 
among a small group of interested mathematicians, logi- 
cians, and philosophers. 


Chapter 2 


ANCIENT WESTERN 
MATHEMATICS 


ntil the 1920s it was commonly supposed that 

| mathematics had its birth among the ancient 

Greeks. What was known of earlier traditions, 

such as the Egyptian as represented by the Rhind papy- 

rus (edited for the first time only in 1877), offered at 

best a meagre precedent. This impression gave way to 

a very different view as historians succeeded in deci- 

phering and interpreting the technical materials from 
ancient Mesopotamia. 


Owing to the durability of the Mesopotamian scribes’ clay 
tablets, the surviving evidence of this culture is substan- 
tial. Existing specimens of mathematics represent all the 
major eras—the Sumerian kingdoms of the 3rd millen- 
nium BCE, the Akkadian and Babylonian regimes (2nd 
millennium), and the empires of the Assyrians (early Ist 
millennium), Persians (6th through 4th centuries BCE), 
and Greeks (3rd century BCE to Ist century CE). The 
level of competence was already high as early as the Old 
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Babylonian dynasty, the time of the lawgiver-king Ham- 
murabi (c. 18th century BCE), but after that there were 
few notable advances. The application of mathematics 
to astronomy, however, flourished during the Persian 
and Seleucid (Greek) periods. 


THE NUMERAL SYSTEM AND ARITHMETIC OPERATIONS 


Unlike the Egyptians, the mathematicians of the Old 
Babylonian period went far beyond the immediate chal- 
lenges of their official accounting duties. For example, 
they introduced a versatile numeral system, which, like 
the modern system, exploited the notion of place value, 
and they developed computational methods that took 
advantage of this means of expressing numbers; they 
solved linear and quadratic problems by methods much 
like those now used in algebra; their success with the 
study of what are now called Pythagorean number tri- 
ples was a remarkable feat in number theory. The scribes 
who made such discoveries must have believed mathe- 
matics to be worthy of study in its own right, not just as 
a practical tool. 

The older Sumerian system of numerals followed 
an additive decimal (base-10) principle similar to that of 
the Egyptians. But the Old Babylonian system converted 
this into a place-value system with the base of 60 (sex- 
agesimal). The reasons for the choice of 60 are obscure, 
but one good mathematical reason might have been the 
existence of so many divisors (2, 3, 4, and 5, and some 
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multiples) of the base, which would have greatly facili- 
tated the operation of division. For numbers from | to 
59, the symbols T for 1 and 4 for 10 were combined in the 
simple additive manner (e.g., 44 4 TT represented 32). 
But, to express larger values, the Babylonians applied the 
concept of place value: for example, 60 was written as 
70 as T -¢ 80as | & {, and so on. In fact, T could rep- 
resent any power of 60. The context determined which 
power was intended. The Babylonians appear to have 
developed a placeholder symbol that functioned as a 
zero by the 3rd century BCE, but its precise meaning and 
use is still uncertain. Furthermore, they had no mark to 
separate numbers into integral and fractional parts (as 
with the modern decimal point). Thus, the three-place 
numeral > 7 30 could represent 3'/, (i.e, 3 + 7/60 + 
30/60°), 1871, (ie.,3 % 60 + 7 + 30/60), 11,250 (.e., 3 x 
60 +7 x 60 i 30), or a multiple of these numbers by any 
power of 60. 

The four arithmetic operations were performed in 
the same way as in the modern decimal system, except 
that carrying occurred whenever a sum reached 60 
rather than 10. Multiplication was facilitated by means 
of tables; one typical tablet lists the multiples of a num- 
ber by 1, 2, 3,..., 19, 20, 30, 40, and 50. To multiply two 
numbers several places long, the scribe first broke the 
problem down into several multiplications, each by a 
one-place number, and then looked up the value of each 
product in the appropriate tables. He found the answer 
to the problem by adding up these intermediate results. 
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These tables also assisted in division, for the values that 
head them were all reciprocals of regular numbers. 

Regular numbers are those whose prime factors divide 
the base; the reciprocals of such numbers thus have only a 
finite number of places (by contrast, the reciprocals of non- 
regular numbers produce an infinitely repeating numeral). 
In base 10, for example, only numbers with factors of 2 and 
5 (e.g., 8 or 50) are regular, and the reciprocals (1/8 = 0.125, 
1/50 = 0.02) have finite expressions; but the reciprocals of 
other numbers (such as 3 and 7) repeat infinitely 


O14e2857 


, respectively, where the bar indicates the digits that con- 
tinually repeat). In base 60, only numbers with factors 
of 2, 3, and 5 are regular; for example, 6 and 54 are reg- 
ular, so that their reciprocals (10 and 1 6 40) are finite. 
The entries in the multiplication table for 1 6 40 are thus 
simultaneously multiples of its reciprocal 1/54. To divide 
a number by any regular number, then, one can consult 
the table of multiples for its reciprocal. 

An interesting tablet in the collection of Yale Uni- 
versity shows a square with its diagonals; on one side is 
written “30,” under one diagonal “42 25 35,” and right 
along the same diagonal “1 24 51 10” (Le., 1 + 24/60 + 
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51/60 + 10/60). This third number is the correct value 
of \2 to four sexagesimal places (equivalent in the dec- 
imal system to 1.414213..., which is too low by only 1 
in the seventh place), while the second number is the 
product of the third number and the first and so gives 
the length of the diagonal when the side is 30. The scribe 
thus appears to have known an equivalent of the famil- 
iar long method of finding square roots. An additional 
element of sophistication is that, by choosing 30 (that is, 
1/2) for the side, the scribe obtained as the diagonal the 
reciprocal of the value of 2 (since y2/2 = 1/V2), a result 
useful for purposes of division. 


GEOMETRIC AND ALGEBRAIC PROBLEMS 


In a Babylonian tablet now in Berlin, the diagonal of 
a rectangle of sides 40 and 10 is solved as 40 + 10°1(2 
x 40). Here a very effective approximating rule is being 
used (that the square root of the sum of a +b can be 
estimated as a + b /2a), the same rule found frequently 
in later Greek geometric writings. Both these examples 
for roots illustrate the Babylonians’ arithmetic approach 
in geometry. They also show that the Babylonians were 
aware of the relation between the hypotenuse and the 
two legs of a right triangle (now commonly known as 
the Pythagorean theorem) more than a thousand years 
before the Greeks used it. 

A type of problem that occurs frequently in the 
Babylonian tablets seeks the base and height of a rectan- 


Babylonian mathematical tablet 


gle, where their product and sum have specified values. 
From the given information the scribe worked out the 
difference, since 


(b—h) =(b +h) — 4bh. 
In the same way, if the product and difference were given, 
the sum could be found. And, once both the sum and 
difference were known, each side could be determined, 
for 


2b = (b +h) + (b—h) and 2h = (b +h) - (b—h). 


This procedure is equivalent to a solution of the gen- 
eral quadratic in one unknown. In some places, however, 
the Babylonian scribes solved quadratic problems in terms 
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of asingle unknown, just as would now be done by means 
of the quadratic formula. 

Although these Babylonian quadratic procedures 
have often been described as the earliest appearance 
of algebra, there are important distinctions. The scribes 
lacked an algebraic symbolism; although they must cer- 
tainly have understood that their solution procedures 
were general, they always presented them in terms of 
particular cases, rather than as the working through of 
general formulas and identities. They thus lacked the 
means for presenting general derivations and proofs of 


PYTHAGOREAN THEOREM 


The Pythagorean theorem is the well-known geometric theo- 
rem that the sum of the squares on the legs of a right triangle 
is equal to the square on the hypotenuse (the side opposite 
the right angle)—or, in familiar algebraic notation, at+b=c. 
Although the theorem has long been associated with the 
Greek mathematician-philosopher Pythagoras (c. 5/70- 
500/490 BCE), it is actually far older. Four Babylonian tablets 
from circa 1900-1600 BCE indicate some knowledge of the 
theorem, or at least of special integers known as Pythago- 
rean triples that satisfy it. The theorem is mentioned in the 
Baudhayana Sulba-sutra of India, which was written between 
800 and 400 BCE. Nevertheless, the theorem came to be 
credited to Pythagoras. It is also proposition number 47 from 
Book | of Euclid’s Elements. 

According to the Syrian historian lamblichus (c. 250— 
330 CE), Pythagoras was introduced to mathematics 
by Thales of Miletus and his pupil Anaximander. In any 
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case, it is known that Pythagoras traveled to Egypt about 
535 BCE to further his study, was captured during an 
invasion in 525 BCE by Cambyses II of Persia and taken 
to Babylon, and may possibly have visited India before 
returning to the Mediterranean. Pythagoras soon settled 
in Croton (now Crotone, Italy) and set up a school, or in 
modern terms a monastery, where all members took strict 
vows of secrecy, and all new mathematical results for 
several centuries were attributed to his name. Thus, not 
only is the first proof of the theorem not known, there is also 
some doubt that Pythagoras himself actually proved the 
theorem that bears his name. It was probably independently 
discovered in several different cultures. 

Book | of the Elements ends with Euclid’s famous 
“windmill” proof of the Pythagorean theorem. Later in Book 
VI of the Elements, Euclid delivers an even easier demon- 
stration using the proposition that the areas of similar trian- 
gles are proportionate to the squares of their corresponding 
sides. Apparently, Euclid invented the windmill proof so that 
he could place the Pythagorean theorem as the capstone 
to Book |. He had not yet demonstrated (as he would in 
Book V) that line lengths can be manipulated in proportions 
as if they were commensurable numbers (integers or ratios 
of integers). 

A great many different proofs and extensions of the 
Pythagorean theorem have been invented. Taking extensions 
first, Euclid himself showed in a theorem praised in antiquity 
that any symmetrical regular figures drawn on the sides of a 
right triangle satisfy the Pythagorean relationship: the figure 
drawn on the hypotenuse has an area equal to the sum of the 
areas of the figures drawn on the legs. The semicircles that 
define Hippocrates of Chios’s lunes are examples of such 
an extension. 


(Continued on the next page) 


THE History OF MATHEMATICS 
(Continued from the previous page) 

In the Nine Chapters on the Mathematical Proce- 
dures (or Nine Chapters), compiled in the 1st century CE in 
China, several problems are given, along with their solu- 
tions, that involve finding the length of one of the sides of a 
right triangle when given the other two sides. In the Com- 
mentary of Liu Hui, from the 3rd century, Liu Hui offered a 
proof of the Pythagorean theorem that called for cutting up 
the squares on the legs of the right triangle and rearrang- 
ing them (“tangram style”) to correspond to the square on 
the hypotenuse. 

The Pythagorean theorem has fascinated people for 
nearly 4,000 years; there are now an estimated 367 different 
proofs, including ones by the Greek mathematician Pappus 
of Alexandria (flourished c. 320 CE), the Arab mathemati- 
cian-physician Thabit ibn Qurrah (c. 836-901), the Italian art- 
ist-inventor Leonardo da Vinci (1452-1519), and even U.S. 
President James Garfield (1831-1881). 


their solution procedures. Their use of sequential pro- 
cedures rather than formulas, however, is less likely to 
detract from an evaluation of their effort now that algo- 
rithmic methods much like theirs have become common- 
place through the development of computers. 

As mentioned above, the Babylonian scribes knew 
that the base (5), height (h), and diagonal (d) of a rectangle 
satisfy the relation b +h =d If one selects values at ran- 
dom for two of the terms, the third will usually be irratio- 
nal, but it is possible to find cases in which all three terms 
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are integers: for example, 3, 4, 5 and 5, 12, 13. (Such solu- 
tions are sometimes called Pythagorean triples.) A tablet 
in the Columbia University Collection presents a list of 15 
such triples (decimal equivalents are shown in parentheses 
at the right; the gaps in the expressions for h, b, and d sep- 
arate the place values in the sexagesimal numerals): 


ia) C 
1 53 2 49 (20 119 


169) 
355} 


36 56 7 1 20 45 (3,456 3,367 4,825) 

1 20 1 16 41 1 50 4¢ i J 4,601 6,649) 
45 331 49 5 1 al J 12,709 18,541) 

1 12 1 5 1 37 f2, AS oF} 

1 30 56 1 46 (90 56 106) 


(The entries in the column for hf have to be 
computed from the values for b and d, for they do 
not appear on the tablet; but they must once have 
existed on a portion now missing.) The ordering of 
the lines becomes clear from another column, listing 
the values of d/h" (brackets indicate figures that are 
lost or illegible), which form a continually decreasing 
sequence: [1 59 0] 15, [1 56 56] 58 14 50 6 15.,..., [1] 
23 13 46 40. Accordingly, the angle formed between 
the diagonal and the base in this sequence increases 
continually from just over 45° to just under 60°. 
Other properties of the sequence suggest that the 
scribe knew the general procedure for finding all 
such number triples—that for any integers p and gq, 
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2dlh = plq + q/p and 2b/h = p/q — q/p. (In the table the 
implied values p and g turn out to be regular numbers 
falling in the standard set of reciprocals, as mentioned 
earlier in connection with the multiplication tables.) 
Scholars are still debating nuances of the construction 
and the intended use of this table, but no one questions 
the high level of expertise implied by it. 


MATHEMATICAL ASTRONOMY 


The sexagesimal method developed by the Babylonians 
has a far greater computational potential than what was 
actually needed for the older problem texts. With the 
development of mathematical astronomy in the Seleu- 
cid period, however, it became indispensable. Astrono- 
mers sought to predict future occurrences of important 
phenomena, such as lunar eclipses and critical points in 
planetary cycles (conjunctions, oppositions, stationary 
points, and first and last visibility). They devised a tech- 
nique for computing these positions (expressed in terms 
of degrees of latitude and longitude, measured relative 
to the path of the Sun’s apparent annual motion) by suc- 
cessively adding appropriate terms in arithmetic progres- 
sion. The results were then organized into a table listing 
positions as far ahead as the scribe chose. (Although the 
method is purely arithmetic, one can interpret it graphi- 
cally: the tabulated values form a linear “zigzag” approx- 
imation to what is actually a sinusoidal variation.) While 
observations extending over centuries are required for 
finding the necessary parameters (e.g., periods, angular 
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range between maximum and minimum values, and the 
like), only the computational apparatus at their disposal 
made the astronomers’ forecasting effort possible. 

Within a relatively short time (perhaps a century or 
less), the elements of this system came into the hands of 
the Greeks. Although Hipparchus (2nd century BCE) 
favoured the geometric approach of his Greek predeces- 
sors, he took over parameters from the Mesopotamians and 
adopted their sexagesimal style of computation. Through 
the Greeks it passed to Arab scientists during the Middle 
Ages and thence to Europe, where it remained prominent 
in mathematical astronomy during the Renaissance and the 
early modern period. To this day it persists in the use of 
minutes and seconds to measure time and angles. 

Aspects of the Old Babylonian mathematics may 
have come to the Greeks even earlier, perhaps in the 5th 
century BCE, the formative period of Greek geometry. 
There are a number of parallels that scholars have noted: 
for example, the Greek technique of “application of 
area” corresponded to the Babylonian quadratic meth- 
ods (although in a geometric, not arithmetic, form). Fur- 
ther, the Babylonian rule for estimating square roots was 
widely used in Greek geometric computations, and there 
may also have been some shared nuances of technical 
terminology. Although details of the timing and man- 
ner of such a transmission are obscure because of the 
absence of explicit documentation, it seems that Western 
mathematics, while stemming largely from the Greeks, is 
considerably indebted to the older Mesopotamians. 


THE History OF MATHEMATICS 
TT = = 


MATHEMATICS IN ANCIENT EGypT 


The introduction of writing in Egypt in the predynastic 
period (c. 3000 BCE) brought with it the formation of 
a special class of literate professionals, the scribes. By 
virtue of their writing skills, the scribes took on all the 
duties of a civil service: record keeping, tax accounting, 
the management of public works (building projects and 
the like), even the prosecution of war through overseeing 
military supplies and payrolls. Young men enrolled in 
scribal schools to learn the essentials of the trade, which 
included not only reading and writing but also the basics 
of mathematics. 

One of the texts popular as a copy exercise in the 
schools of the New Kingdom (13th century BCE) was 
a satiric letter in which one scribe, Hori, taunts his rival, 
Amen-em-opet, for his incompetence as an adviser and 
manager. “You are the clever scribe at the head of the 
troops,” Hori chides at one point; 


a ramp is to be built, 730 cubits long, 55 cubits wide, 
with 120 compartments—it is 60 cubits high, 30 cubits 
in the middle...and the generals and the scribes turn 
to you and say, “ You are a clever scribe, your name is 
famous. Is there anything you don’t know? Answer us, 
how many bricks are needed?” Let each compartment 
be 30 cubits by 7 cubits. 


This problem, and three others like it in the same 
letter, cannot be solved without further data. But the 
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point of the humour 1s clear, as Hori challenges his rival 
with these hard, but typical, tasks. 

What is known of Egyptian mathematics tallies 
well with the tests posed by the scribe Hori. The infor- 
mation comes primarily from two long papyrus doc- 
uments that once served as textbooks within scribal 
schools. The Rhind papyrus (in the British Museum) 
is a copy made in the 17th century BCE of a text 
two centuries older still. In it is found a long table 
of fractional parts to help with division, followed 
by the solutions of 84 specific problems in arithme- 
tic and geometry. The Golenishchev papyrus (in the 
Moscow Museum of Fine Arts), dating from the 19th 
century BCE, presents 25 problems of a similar type. 
These problems reflect well the functions the scribes 
would perform, for they deal with how to distribute 
beer and bread as wages, for example, and how to 
measure the areas of fields as well as the volumes of 
pyramids and other solids. 


THE NUMERAL SYSTEM AND ARITHMETIC OPERATIONS 


The Egyptians, like the Romans after them, expressed 
numbers according to a decimal scheme, using separate 
symbols for 1, 10, 100, 1,000, and so on; each symbol 
appeared in the expression for a number as many times as 
the value it represented occurred in the number itself. For 
example, MMIII stood for 24. This rather cumbersome 
notation was used within the hieroglyphic writing found 
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in stone inscriptions and other formal texts, but in the 
papyrus documents the scribes employed a more conve- 
nient abbreviated script, called hieratic writing, where, 
for example, 24 was written ‘-. 

In such a system, addition and subtraction amount 
to counting how many symbols of each kind there are 
in the numerical expressions and then rewriting with the 
resulting number of symbols. The texts that survive do 
not reveal what, if any, special procedures the scribes 
used to assist in this. But for multiplication they intro- 
duced a method of successive doubling. For example, to 
multiply 28 by 11, one constructs a table of multiples of 
28 like the following: 


The several entries in the first column that together 
sum to 11 (1.e., 8, 2, and 1) are checked off. The product 
is then found by adding up the multiples corresponding to 
these entries; thus, 224 + 56 + 28 = 308, the desired product. 

To divide 308 by 28, the Egyptians applied the same 
procedure in reverse. Using the same table as in the mul- 
tiplication problem, one can see that 8 produces the larg- 
est multiple of 28 that is less then 308 (for the entry at 16 
is already 448), and 8 is checked off. The process is then 
repeated, this time for the remainder (84) obtained by sub- 
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Ancient Egyptian hieroglyphic numeral system 


1 10 1,000 10,000 100,000 1,000,000 
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Egyptian hieratic numerals (mathematical papyrus, c. 1600 Bc) 
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Because ancient Egyptians did not have a positional 
system, they needed separate symbols for each power of 


10. Bottom: Heiratic numerals. 


tracting the entry at 8 (224) from the original number (308). 
This, however, is already smaller than the entry at 4, which 
consequently is ignored, but it is greater than the entry at 
2 (56), which is then checked off. The process is repeated 
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again for the remainder obtained by subtracting 56 from 
the previous remainder of 84, or 28, which also happens to 
exactly equal the entry at 1 and which 1s then checked off. 
The entries that have been checked off are added up, yield- 
ing the quotient: 8 + 2 + 1 = 11. (In most cases, of course, 
there is a remainder that is less than the divisor.) 

For larger numbers this procedure can be improved 
by considering multiples of one of the factors by 10, 20,... 
or even by higher orders of magnitude (100, 1,000,...), 
as necessary (in the Egyptian decimal notation, these 
multiples are easy to work out). Thus, one can find the 
product of 28 by 27 by setting out the multiples of 28 
by 1, 2, 4, 8, 10, and 20. Since the entries 1, 2, 4, and 20 
add up to 27, one has only to add up the corresponding 
multiples to find the answer. 

Computations involving fractions are carried out 
under the restriction to unit parts (that is, fractions 
that in modern notation are written with 1 as the 
numerator). To express the result of dividing 4 by 7, 
for instance, which in modern notation is simply 4/7, 
the scribe wrote 1/2 + 1/14. The procedure for finding 
quotients in this form merely extends the usual method 
for the division of integers, where one now inspects the 
entries for 2/3, 1/3, 1/6, etc., and 1/2, 1/4, 1/8, etc., until 
the corresponding multiples of the divisor sum to the 
dividend. (The scribes included 2/3, one may observe, 
even though it is not a unit fraction.) In practice the 
procedure can sometimes become quite complicated 
(for example, the value for 2/29 is given in the Rhind 
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papyrus as 1/24 + 1/58 + 1/174 + 1/232) and can be 
worked out in different ways (for example, the same 
2/29 might be found as 1/15 + 1/435 or as 1/16 + 1/232 
+ 1/464, etc.). A considerable portion of the papyrus 
texts is devoted to tables to facilitate the finding of such 
unit-fraction values. 

These elementary operations are all that one needs 
for solving the arithmetic problems in the papyri. For 
example, “to divide 6 loaves among 10 men” (Rhind 
papyrus, problem 3), one merely divides to get the 
answer 1/2 + 1/10. In one group of problems an inter- 
esting trick is used: “A quantity (aha) and its 7th 
together make 19—what is it?” (Rhind papyrus, prob- 
lem 24). Here one first supposes the quantity to be 7: 
since ign of it becomes 8, not 19, one takes 19/8 (that 
is, 2 + 1/4 + 1/8), and its multiple by 7 (16 + 1/2 + 1/8) 
becomes the required answer. This type of procedure 
(sometimes called the method of “false position” or 
“false assumption’) is familiar in many other arithme- 
tic traditions (e.g., the Chinese, Hindu, Muslim, and 
Renaissance European), although they appear to have 
no direct link to the Egyptian. 


GEOMETRY 


The geometric problems in the papyri seek measure- 
ments of figures, like rectangles and triangles of given 
base and height, by means of suitable arithmetic oper- 
ations. In a more complicated problem, a rectangle is 
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sought whose area is 12 and whose height is 1/2 + 1/4 
times its base (Golenishchev papyrus, problem 6). To 
solve the problem, the ratio is inverted and multiplied 
by the area, yielding 16; the square root of the result (4) 
is the base of the rectangle, and 1/2 + 1/4 times 4, or 3, 
is the height. The entire process is analogous to the pro- 
cess of solving the algebraic equation for the problem 
(x Xx 7 ,x = 12), though without the use of a letter for the 
unknown. An interesting procedure is used to find the 
area of the circle (Rhind papyrus, problem 50): 1/9 of 
the diameter is discarded, and the result 1s squared. For 
example, if the diameter is 9, the area 1s set equal to 64. 
The scribe recognized that the area of a circle is pro- 
portional to the square of the diameter and assumed 
for the constant of proportionality (that is, 1/4) the 
value 64/81. This is a rather good estimate, being about 
0.6 percent too large. (It is not as close, however, as the 
now common estimate of 31, first proposed by Archi- 
medes, which is only about 0.04 percent too large.) But 
there is nothing in the papyri indicating that the scribes 
were aware that this rule was only approximate rather 
than exact. 

A remarkable result is the rule for the volume of 
the truncated pyramid (Golenishchev papyrus, prob- 
lem 14). The scribe assumes the height to be 6, the base 
to be a square of side 4, and the top a square of side 
2. He multiplies one-third the height times 28, finding 
the volume to be 56; here 28 is computed from 2 x 2 + 
2x 4+ 4x 4. Since this 1s correct, it can be assumed 
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that the scribe also knew the general rule: A = (h/3) 
(a + abr b). How the scribes actually derived the rule 
is a matter for debate, but it is reasonable to suppose 
that they were aware of related rules, such as that for 
the volume of a pyramid: one-third the height times the 
area of the base. 

The Egyptians employed the equivalent of similar 
triangles to measure distances. For instance, the seked of 
a pyramid is stated as the number of palms in the hori- 
zontal corresponding to arise of one cubit (seven palms). 
Thus, if the seked 1 is 5 1, and the base is 140 cubits, the 
height becomes 93 L eabits (Rhind papyrus, problem 57). 
The Greek sage Thales of Miletus (6th century BCE) 1s 
said to have measured the height of pyramids by means 
of their shadows (the report derives from Hieronymus, a 
disciple of Aristotle in the 4th century BCE). In light of 
the seked computations, however, this report must indi- 
cate an aspect of Egyptian surveying that extended back 
at least 1,000 years before the time of Thales. 


ASSESSMENT OF EGYPTIAN MATHEMATICS 


The papyri thus bear witness to a mathematical tradition 
closely tied to the practical accounting and surveying 
activities of the scribes. Occasionally, the scribes loos- 
ened up a bit: one problem (Rhind papyrus, problem 79), 
for example, seeks the total from seven houses, seven cats 
per house, seven mice per cat, seven ears of wheat per 
mouse, and seven hekat of grain per ear (result: 19,607). 


|. ae 


Certainly the scribe’s interest in progressions (for which 
he appears to have a rule) goes beyond practical consid- 
erations. Other than this, however, Egyptian mathemat- 
ics falls firmly within the range of practice. 

Even allowing for the scantiness of the documen- 
tation that survives, the Egyptian achievement in math- 
ematics must be viewed as modest. Its most striking 
features are competence and continuity. The scribes man- 
aged to work out the basic arithmetic and geometry nec- 
essary for their official duties as civil managers, and their 
methods persisted with little evident change for at least 
a millennium, perhaps two. Indeed, when Egypt came 
under Greek domination in the Hellenistic period (from 
the 3rd century BCE onward), the older school meth- 


180 cubits 


250 cubits 


The Egyptians defined the seked as the ratio 
of the run to the rise, which is the reciprocal 


of the modern definition of the slope. 
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ods continued. Quite remarkably, the older unit-fraction 
methods are still prominent in Egyptian school papyri 
written in the demotic (Egyptian) and Greek languages 
as late as the 7th century CE, for example. 

To the extent that Egyptian mathematics left a 
legacy at all, it was through its impact on the emerging 
Greek mathematical tradition between the 6th and 4th 
centuries BCE. Because the documentation from this 
period is limited, the manner and significance of the 
influence can only be conjectured. But the report about 
Thales measuring the height of pyramids is only one of 
several such accounts of Greek intellectuals learning 
from Egyptians; Herodotus and Plato describe with 
approval Egyptian practices in the teaching and appli- 
cation of mathematics. This literary evidence has histor- 
ical support, since the Greeks maintained continuous 
trade and military operations in Egypt from the 7th cen- 
tury BCE onward. It is thus plausible that basic precedents 
for the Greeks’ earliest mathematical efforts—how they 
dealt with fractional parts or measured areas and volumes, 
or their use of ratios in connection with similar figures— 
came from the learning of the ancient Egyptian scribes. 


GREEK MATHEMATICS 


THE Pre-EuCcLIDEAN PERIOD 


The Greeks divided the field of mathematics into arith- 
metic (the study of “multitude,” or discrete quantity) and 
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geometry (that of “magnitude,” or continuous quantity) 
and considered both to have originated in practical activ- 
ities. Proclus, in his Commentary on Euclid, observes that 
geometry—literally, “measurement of land”—first arose 
in surveying practices among the ancient Egyptians, for 
the flooding of the Nile compelled them each year to 
redefine the boundaries of properties. Similarly, arith- 
metic started with the commerce and trade of Phoent- 
cian merchants. Although Proclus wrote quite late in the 
ancient period (in the Sth century CE), his account drew 
upon views proposed much earlier—by Herodotus (mid- 
5th century BCE), for example, and by Eudemus, a disci- 
ple of Aristotle (late 4th century BCE). 

However plausible, this view is difficult to check, for 
there is only meagre evidence of practical mathematics 
from the early Greek period (roughly, the 8th through 
the 4th century BCE). Inscriptions on stone, for example, 
reveal use of a numeral system the same in principle as 
the familiar Roman numerals. Herodotus seems to have 
known of the abacus as an aid for computation by both 
Greeks and Egyptians, and about a dozen stone speci- 
mens of Greek abaci survive from the 5th and 4th centu- 
ries BCE. In the surveying of new cities in the Greek col- 
onies of the 6th and Sth centuries, there was regular use 
of a standard length of 70 plethra (one plethron equals 
100 feet) as the diagonal of a square of side 50 plethra; 
in fact, the actual diagonal of the square is 502 plethra, 
so this was equivalent to using 7/5 (or 1.4) as an esti- 
mate for \2, which is now known to equal 1.414.... In 
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the 6th century BCE the engineer Eupalinus of Megara 
directed an aqueduct through a mountain on the island 
of Samos, and historians still debate how he did it. Ina 
further indication of the practical aspects of early Greek 
mathematics, Plato describes in his Laws how the Egyp- 
tians drilled their children in practical problems in arith- 
metic and geometry; he clearly considered this a model 
for the Greeks to imitate. 

Such hints about the nature of early Greek practical 
mathematics are confirmed in later sources—for exam- 
ple, in the arithmetic problems in papyrus texts from 
Ptolemaic Egypt (from the 3rd century BCE onward) 
and the geometric manuals by Heron of Alexandria (1st 
century CE). In its basic manner this Greek tradition 
was much like the earlier traditions in Egypt and Mes- 
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opotamia. Indeed, it is likely that the Greeks borrowed 
from such older sources to some extent. 

What was distinctive of the Greeks’ contribution 
to mathematics—and what in effect made them the 
creators of “mathematics,” as the term is usually under- 
stood—was its development as a theoretical discipline. 
This means two things: mathematical statements are 
general, and they are confirmed by proof. For example, 
the Mesopotamians had procedures for finding whole 
numbers a, b, and c for which atb=c (e.g., 3, 4, 5; 5, 
12, 13; or 119, 120, 169). From the Greeks came a proof 
of a general rule for finding all such sets of numbers 
(now called Pythagorean triples): if one takes any whole 
numbers p and q, both being even or both odd, then a = 
(p — q)/2, b = pg, andc=(p + q)/2. As Euclid proves 
in Book X of the Elements, numbers of this form satisfy 
the relation for Pythagorean triples. Further, the Meso- 
potamians appear to have understood that sets of such 
numbers a, b, and c form the sides of right triangles, but 
the Greeks proved this result (Euclid, in fact, proves it 
twice: in Elements, Book I, proposition 47, and in a more 
general form in Elements, Book VI, proposition 31), and 
these proofs occur in the context of a systematic presen- 
tation of the properties of plane geometric figures. 

The Elements, composed by Euclid of Alexandria 
about 300 BCE, was the pivotal contribution to theo- 
retical geometry, but the transition from practical to 
theoretical mathematics had occurred much earlier, 
sometime in the 5th century BCE. Initiated by men 
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like Pythagoras of Samos (late 6th century) and Hippo- 
crates of Chios (late 5th century), the theoretical form of 
geometry was advanced by others, most prominently the 
Pythagorean Archytas of Tarentum, Theaetetus of Ath- 
ens, and Eudoxus of Cnidus (4th century). Because the 
actual writings of these men do not survive, knowledge 
about their work depends on remarks made by later writ- 
ers. While even this limited evidence reveals how heavily 
Euclid depended on them, it does not set out clearly the 
motives behind their studies. 

It is thus a matter of debate how and why this the- 
oretical transition took place. A frequently cited factor 
is the discovery of irrational numbers. The early Pythag- 
oreans held that “all things are number.” This might be 
taken to mean that any geometric measure can be asso- 
ciated with some number (that is, some whole number 
or fraction; in modern terminology, rational number), 
for in Greek usage the term for number, arithmos, refers 
exclusively to whole numbers or, in some contexts, to 
ordinary fractions. This assumption is common enough 
in practice, as when the length of a given line is said to 
be so many feet plus a fractional part. However, it breaks 
down for the lines that form the side and diagonal of 
the square. (For example, if it is supposed that the ratio 
between the side and diagonal may be expressed as the 
ratio of two whole numbers, it can be shown that both 
of these numbers must be even. This is impossible, since 
every fraction may be expressed as a ratio of two whole 
numbers having no common factors.) Geometrically, 
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this means that there is no length that could serve as a 
unit of measure of both the side and diagonal; that is, 
the side and diagonal cannot each equal the same length 
multiplied by (different) whole numbers. Accordingly, 
the Greeks called such pairs of lengths “incommensura- 
ble.” (In modern terminology, unlike that of the Greeks, 
the term “number” is applied to such quantities as y2, 
but they are called irrational.) 

This result was already well known at the time of 
Plato and may well have been discovered within the 
school of Pythagoras in the 5th century BCE, as some late 
authorities like Pappus of Alexandria (4th century CE) 
maintain. In any case, by 400 BCE it was known that 
lines corresponding to 3, \5, and other square roots 
are incommensurable with a fixed unit length. The more 
general result, the geometric equivalent of the theorem 
that \p is irrational whenever p is not a rational square 
number, is associated with Plato’s friend Theaetetus. 
Both Theaetetus and Eudoxus contributed to the fur- 
ther study of irrationals, and their followers collected the 
results into a substantial theory, as represented by the 
115 propositions of Book X of the Elements. 

The discovery of irrationals must have affected the 
very nature of early mathematical research, for it made 
clear that arithmetic was insufficient for the purposes 
of geometry, despite the assumptions made in practical 
work. Further, once such seemingly obvious assumptions 
as the commensurability of all lines turned out to be in 
fact false, then in principle all mathematical assumptions 
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were rendered suspect. At the least it became necessary to 
justify carefully all claims made about mathematics. Even 
more basically, it became necessary to establish what a 
reasoning has to be like to qualify as a proof. Appar- 
ently, Hippocrates of Chios, in the Sth century BCE, 
and others soon after him had already begun the work 
of organizing geometric results into a systematic form 
in textbooks called “elements” (meaning “fundamental 
results” of geometry). These were to serve as sources for 
Euclid in his comprehensive textbook a century later. 

The early mathematicians were not an isolated 
group but part of a larger, intensely competitive intel- 
lectual environment of pre-Socratic thinkers in Ionia 
and Italy, as well as Sophists at Athens. By insisting that 
only permanent things could have real existence, the 
philosopher Parmenides (Sth century BCE) called into 
question the most basic claims about knowledge itself. 
In contrast, Heracleitus (c. 500 BCE) maintained that 
all permanence is an illusion, for the things that are per- 
ceived arise through a subtle balance of opposing ten- 
sions. What 1s meant by “knowledge” and “proof” thus 
came into debate. 

Mathematical issues were often drawn into these 
debates. For some, like the Pythagoreans (and, later, 
Plato), the certainty of mathematics was held as a 
model for reasoning in other areas, like politics and 
ethics. But for others mathematics seemed prone to con- 
tradiction. Zeno of Elea (Sth century BCE) posed par- 
adoxes about quantity and motion. In one such para- 
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dox it is assumed that a line can be bisected again and 
again without limit; if the division ultimately results 
in a set of points of zero length, then even infinitely 
many of them sum up only to zero, but, if it results 
in tiny line segments, then their sum will be infinite. In 
effect, the length of the given line must be both zero 
and infinite. In the 5th century BCE a solution of such 
paradoxes was attempted by Democritus and the atom- 
ists, philosophers who held that all material bodies 
are ultimately made up of invisibly small “atoms” (the 
Greek word atomon means “indivisible”). But in geom- 
etry such a view came into conflict with the existence of 
incommensurable lines, since the atoms would become 
the measuring units of all lines, even incommensurable 
ones. Democritus and the Sophist Protagoras puzzled 
over whether the tangent to a circle meets it at a point 
or a line. The Sophists Antiphon and Bryson (both 5th 
century BCE) considered how to compare the circle to 
polygons inscribed in it. 

The pre-Socratics thus revealed difficulties in specific 
assumptions about the infinitely many and the infinitely 
small and about the relation of geometry to physi- 
cal reality, as well as in more general conceptions like 
“existence” and “proof.” Philosophical questions such 
as these need not have affected the technical researches 
of mathematicians, but they did make them aware of 
difficulties that could bear on fundamental matters 
and so made them the more cautious in defining their 
subject matter. 
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Any such review of the possible effects of factors 
such as these is purely conjectural, since the sources are 
fragmentary and never make explicit how the mathema- 
ticlans responded to the issues that were raised. But it 
is the particular concern over fundamental assumptions 
and proofs that distinguishes Greek mathematics from 
the earlier traditions. Plausible factors behind this con- 
cern can be identified in the special circumstances of 
the early Greek tradition—its technical discoveries and 
its cultural environment—even if it is not possible to 
describe in detail how these changes took place. 


THE ELEMENTS 


The principal source for reconstructing pre-Euclidean 
mathematics is Euclid’s Elements, for the major part of 
its contents can be traced back to research from the 4th 
century BCE and in some cases even earlier. The first 
four books present constructions and proofs of plane 
geometric figures: Book I deals with the congruence of 
triangles, the properties of parallel lines, and the area 
relations of triangles and parallelograms; Book II estab- 
lishes equalities relating to squares, rectangles, and tri- 
angles; Book III covers basic properties of circles; and 
Book IV sets out constructions of polygons in circles. 
Much of the content of Books I-III was already famil- 
iar to Hippocrates, and the material of Book IV can be 
associated with the Pythagoreans, so that this portion 
of the Elements has roots in 5th-century research. It is 
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known, however, that questions about parallels were 
debated in Aristotle’s school (c. 350 BCE), and so it may 
be assumed that efforts to prove results—such as the 
theorem stating that, for any given line and given point, 
there always exists a unique line through that point and 
parallel to the line—were tried and failed. Thus, the deci- 
sion to found the theory of parallels on a postulate, as in 
Book I of the E/ements, must have been a relatively recent 
development in Euclid’s time. (The postulate would later 
become the subject of much study, and in modern times 
it led to the discovery of the so-called non-Euclidean 
geometries. ) 

Book V sets out a general theory of proportion— 
that is, a theory that does not require any restriction to 
commensurable magnitudes. This general theory derives 
from Eudoxus. On the basis of the theory, Book VI 
describes the properties of similar plane rectilinear fig- 
ures and so generalizes the congruence theory of Book 
I. It appears that the technique of similar figures was 
already known in the 5th century BCE, even though a 
fully valid justification could not have been given before 
Eudoxus worked out his theory of proportion. 

Books VII-IX deal with what the Greeks called 
“arithmetic,” the theory of whole numbers. It includes 
the properties of numerical proportions, greatest com- 
mon divisors, least common multiples, and relative 
primes (Book VII); propositions on numerical progres- 
sions and square and cube numbers (Book VIII); and 
special results, like unique factorization into primes, the 
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existence of an unlimited number of primes, and the 
formation of “perfect numbers”—that is, those numbers 
that equal the sum of their proper divisors (Book IX). In 
some form Book VII stems from Theaetetus and Book 
VIUI from Archytas. 

Book X presents a theory of irrational lines and 
derives from the work of Theaetetus and Eudoxus. The 
remaining books treat the geometry of solids. Book XI 
sets out results on solid figures analogous to those for 
planes in Books I and VI; Book XII proves theorems on 
the ratios of circles, the ratios of spheres, and the vol- 
umes of pyramids and cones; Book XIII shows how to 
inscribe the five regular solids—known as the Platonic 
solids—in a given sphere (compare the constructions of 
plane figures in Book IV). The measurement of curved 
figures in Book XII is inferred from that of rectilinear 
figures; for a particular curved figure, a sequence of rec- 
tilinear figures is considered in which succeeding figures 
in the sequence become continually closer to the curved 
figure; the particular method used by Euclid derives from 
Eudoxus. The solid constructions in Book XIII derive 
from Theaetetus. 

In sum the Elements gathered together the whole 
field of elementary geometry and arithmetic that had 
developed in the two centuries before Euclid. Doubt- 
less, Euclid must be credited with particular aspects of 
this work, certainly with its editing as a comprehensive 
whole. But it is not possible to identify for certain even 
a single one of its results as having been his discovery. 
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Other, more advanced fields, though not touched on in 
the Elements, were already being vigorously studied in 
Euclid’s time, in some cases by Euclid himself. For these 
fields his textbook, true to its name, provides the appro- 
priate “elementary” introduction. 

One such field is the study of geometric construc- 
tions. Euclid, like geometers in the generation before 
him, divided mathematical propositions into two kinds: 
“theorems” and “problems.” A theorem makes the claim 
that all terms of a certain description have a specified 
property; a problem seeks the construction of a term 
that is to have a specified property. In the Elements all the 
problems are constructible on the basis of three stated 
postulates: that a line can be constructed by joining two 
given points, that a given line segment can be extended 
in a line indefinitely, and that a circle can be constructed 
with a given point as centre and a given line segment as 
radius. These postulates in effect restricted the construc- 
tions to the use of the so-called Euclidean tools—1.e., a 
compass and a straightedge or unmarked ruler. 


THE THREE CLASSICAL PROBLEMS 


Although Euclid solves more than 100 construction 
problems in the Elements, many more were posed whose 
solutions required more than just compass and straight- 
edge. Three such problems stimulated so much interest 
among later geometers that they have come to be known 
as the “classical problems”: doubling the cube (1.¢., con- 
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structing a cube whose volume is twice that of a given 
cube), trisecting the angle, and squaring the circle. Even in 
the pre-Euclidean period the effort to construct a square 
equal in area to a given circle had begun. Some related 
results came from Hippocrates; others were reported 
from Antiphon and Bryson; and Euclid’s theorem on the 
circle in Elements, Book XII, proposition 2, which states 
that circles are in the ratio of the squares of their diam- 
eters, was important for this search. But the first actual 
constructions (not, it must be noted, by means of the 
Euclidean tools, for this 1s impossible) came only in the 
3rd century BCE. The early history of angle trisection 1s 
obscure. Presumably, it was attempted in the pre-Euclid- 
ean period, although solutions are known only from the 
3rd century or later. 

There are several successful efforts at doubling the 
cube that date from the pre-Euclidean period, how- 
ever. Hippocrates showed that the problem could be 
reduced to that of finding two mean proportionals: if for 
a given line a it is necessary to find x such that x = 2a’, 
lines x and y may be sought such that a:x = x-y = y:2a; for 
then a/x = (alx) = (alx)(x/y)(y/2a) = al2a = 1/2. (Note 
that the same argument holds for any multiplier, not just 
the number 2.) Thus, the cube can be doubled if it is pos- 
sible to find the two mean proportionals x and y between 
the two given lines a and 2a. Constructions of the problem 
of the two means were proposed by Archytas, Eudoxus, 
and Menaechmus in the 4th century BCE. Menaechmus, 
for example, constructed three curves corresponding to 
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these same proportions: x = ay, y = 2ax, and xy = 2a’: 
the intersection of any two of them then produces the 
line x that solves the problem. Menaechmus’s curves are 
conic sections: the first two are parabolas, the third a 
hyperbola. Thus, it is often claimed that Menaechmus 
originated the study of the conic sections. Indeed, Pro- 
clus and his older authority, Geminus (mid-Ist cen- 
tury CE), appear to have held this view. The evidence 
does not indicate how Menaechmus actually conceived 


Menaechmus’s solution to the problem of 
doubling the volume of a cube 
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of the curves, however, so it is possible that the formal 
study of the conic sections as such did not begin until 
later, near the time of Euclid. Both Euclid and an older 
contemporary, Aristaeus, composed treatments (now 
lost) of the theory of conic sections. 

In seeking the solutions of problems, geometers devel- 
oped a special technique, which they called “analysis.” 
They assumed the problem to have been solved and then, 
by investigating the properties of this solution, worked 
back to find an equivalent problem that could be solved 
on the basis of the givens. To obtain the formally correct 
solution of the original problem, then, geometers reversed 
the procedure: first the data were used to solve the equiva- 
lent problem derived in the analysis, and, from the solution 
obtained, the original problem was then solved. In contrast 
to analysis, this reversed procedure is called “synthesis.” 

Menaechmus’s cube duplication is an example of 
analysis: he assumed the mean proportionals x and y and 
then discovered them to be equivalent to the result of 
intersecting the three curves whose construction he could 
take as known. (The synthesis consists of introducing 
the curves, finding their intersection, and showing that 
this solves the problem.) It is clear that geometers of the 
4th century BCE were well acquainted with this method, 
but Euclid provides only syntheses, never analyses, of the 
problems solved in the Elements. Certainly in the cases 
of the more complicated constructions, however, there 
can be little doubt that some form of analysis preceded 
the syntheses presented in the Elements. 
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GEOMETRY IN THE 3RD CenTuRY BCE 


The Elements was one of several major efforts by Euclid 
and others to consolidate the advances made over the 
4th century BCE. On the basis of these advances, Greek 
geometry entered its golden age in the 3rd century. This 
was a period rich with geometric discoveries, particularly 
in the solution of problems by analysis and other meth- 
ods, and was dominated by the achievements of two fig- 
ures: Archimedes of Syracuse (early 3rd century BCE) 
and Apollonius of Perga (late 3rd century BCE). 


ARCHIMEDES 


Archimedes was most noted for his use of the Eudox- 
ean method of exhaustion in the measurement of curved 
surfaces and volumes and for his applications of geom- 
etry to mechanics. To him is owed the first appearance 
and proof of the approximation a1, for the ratio of 
the circumference to the diameter of the circle (what is 
now designated 7). Characteristically, Archimedes went 
beyond familiar notions, such as that of simple approx- 
imation, to more subtle insights, like the notion of 
bounds. For example, he showed that the perimeters of 
regular polygons circumscribed about the circle eventu- 
ally become less than 3'/, the diameter as the number of 
their sides increases (Archimedes established the result 
for 96-sided polygons); similarly, the perimeters of the 
inscribed polygons eventually become greater than 3"). 


ANCIENT WESTERN MATHEMATICS 
te 


Thus, these two values are upper and lower bounds, 
respectively, of 1. 

Archimedes’ result bears on the problem of cir- 
cle quadrature in the light of another theorem he proved: 
that the area of a circle equals the area of a triangle 
whose height equals the radius of the circle and whose 
base equals its circumference. He established analogous 
results for the sphere showing that the volume of a sphere 
is equal to that of a cone whose height equals the radius 
of the sphere and whose base equals its surface area; 
the surface area of the sphere he found to be four times 
the area of its greatest circle. Equivalently, the volume 
of a sphere is shown to be two-thirds that of the cylin- 
der which just contains it (that is, having height and diam- 
eter equal to the diameter of the sphere), while its surface 
is also equal to two-thirds that of the same cylinder (that 
is, if the circles that enclose the cylinder at top and bot- 
tom are included). The Greek historian Plutarch (early 
2nd century CE) relates that Archimedes requested the 
figure for this theorem to be engraved on his tombstone, 
which is confirmed by the Roman writer Cicero (1st cen- 
tury BCE), who actually located the tomb in 75 BCE, 
when he was quaestor of Sicily. 


APOLLONIUS 


The work of Apollonius of Perga extended the field of 
geometric constructions far beyond the range in the Ele- 
ments. For example, Euclid in Book III shows how to 
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draw a circle so as to pass through three given points 
or to be tangent to three given lines; Apollonius (in a 
work called Tangencies, which no longer survives) found 
the circle tangent to three given circles, or tangent to 
any combination of three points, lines, and circles. (The 
three-circle tangency construction, one of the most 
extensively studied geometric problems, has attracted 
more than 100 different solutions in the modern period.) 

Apollonius is best known for his Conics, a treatise 
in eight books (Books I-IV survive in Greek, V—VII in a 
medieval Arabic translation; Book VIII is lost). The conic 
sections are the curves formed when a plane intersects the 
surface of a cone (or 
double cone); it is 
assumed that the sur- 
face of the cone is gen- 
erated by the rotation 
of a line through a fixed 
point around the cir- 
cumference of a circle 
which is in a plane not 
containing that point. 
(The fixed point is the 
vertex of the cone, and 
the rotated line its gen- 
erator.) There are three 
basic types: if the cut- 
ting plane 1s parallel to 
Sphere with circumscribing cylinder one of the positions of 
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the generator, 1t produces a parabola; if 1t meets the cone 
only on one side of the vertex, it produces an ellipse (of 
which the circle is a special case); but, if it meets both 
parts of the cone, it produces a hyperbola. Apollonius sets 
out in detail the properties of these curves. He shows, for 
example, that for given line segments a and b the parab- 
ola corresponds to the relation (in modern notation) 
= = ax, the ellipse to = = ax — ax /b, and the hyperbola 
to y= = ax + ax Ib. 

Apollonius’s treatise on conics in part consolidated 
more than a century of work before him and in part pre- 
sented new findings of his own. As mentioned earlier, 
Euclid had already issued a textbook on the conics, while 
even earlier Menaechmus had played a role in their study. 
The names that Apollonius chose for the curves (the terms 
may be original with him) indicate yet an earlier connec- 
tion. In the pre-Euclidean geometry parabolé referred to 
a specific operation, the “application” of a given area to a 
given line, in which the line x 1s sought such that ax = b 
(where a and 5 are given lines); alternatively, x may be 
sought such that x(a + x) = b, or x(a -x) = b, and in 
these cases the application is said to be in “excess” (hyper- 
bole) or “defect” (elleipsis) by the amount of a square fig- 
ure (namely, x). These constructions, which amount to 
a geometric solution of the general quadratic, appear in 
Books I, II, and VI of the Elements and can be associated 
in some form with the 5th-century Pythagoreans. 

Apollonius presented a comprehensive survey of 
the properties of these curves. A sample of the topics 
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hyperbola 


The conic sections result from 


intersecting a plane with a double 


cone. 


he covered includes 
the following: the 
relations satisfied 
by the diameters 
and tangents of 
conics (Book J); 
how hyperbolas 
are related to their 
“asymptotes,” the 
lines they approach 
without ever meet- 
ing (Book II); how 
to draw tangents to 
given conics (Book 
II); relations of 
chords intersecting 
in conics (Book HI); 
the determination of 
the number of ways 
in which conics may 
intersect (Book IV); 
how to draw “nor- 
mal” lines to conics 


(that is, lines meeting them at right angles; Book V); and 
the congruence and similarity of conics (Book VI). 

By Apollonius’s explicit statement, his results are of 
principal use as methods for the solution of geometric 
problems via conics. While he actually solved only a lim- 
ited set of problems, the solutions of many others can be 
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inferred from his theorems. For instance, the theorems of 
Book HI permit the determination of conics that pass 
through given points or are tangent to given lines. In 
another work (now lost) Apollonius solved the prob- 
lem of cube duplication by conics (a solution related 
in some way to that given by Menaechmus); further, 
a solution of the problem of angle trisection given by 
Pappus may have come from Apollonius or been influ- 
enced by his work. 

With the advance of the field of geometric prob- 
lems by Euclid, Apollonius, and their followers, it 
became appropriate to introduce a classifying scheme: 
those problems solvable by means of conics were 
called solid, while those solvable by means of cir- 
cles and lines only (as assumed in Euclid’s Elements) 
were called planar. Thus, one can double the square 
by planar means (as in Elements, Book II, proposi- 
tion 14), but one cannot double the cube in such a 
way, although a solid construction is possible (as 
given above). Similarly, the bisection of any angle is 
a planar construction (as shown in Elements, Book I, 
proposition 9), but the general trisection of the angle 
is of the solid type. It is not known when the classifica- 
tion was first introduced or when the planar methods 
were assigned canonical status relative to the others, 
but it seems plausible to date this near Apollonius’s 
time. Indeed, much of his work—books like the Jan- 
gencies, the Vergings (or Inclinations), and the Plane 
Loci, now lost but amply described by Pappus—turns 
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on the project of setting out the domain of planar 
constructions in relation to solutions by other means. 
On the basis of the principles of Greek geometry, it 
cannot be demonstrated, however, that it is impossible 
to effect by planar means certain solid constructions 
(like the cube duplication and angle trisection). These 
results were established only by algebraists in the 19th 
century (notably by the French mathematician Pierre 
Laurent Wantzel in 1837). 

A third class of problems, called linear, embraced 
those solvable by means of curves other than the circle 
and the conics (in Greek the word for “line,” grammé, 
refers to all lines, whether curved or straight). For 
instance, one group of curves, the conchoids (from the 
Greek word for “shell”), are formed by marking off a 
certain length on a ruler and then pivoting it about a 
fixed point in such a way that one of the marked points 
stays on a given line; the other marked point traces out a 
conchoid. These curves can be used wherever a solution 
involves the positioning of a marked ruler relative to a 
given line (in Greek such constructions are called neu- 
ses, or “vergings” of a line to a given point). For exam- 
ple, any acute angle (figured as the angle between one 
side and the diagonal of a rectangle) can be trisected by 
taking a length equal to twice the diagonal and moving 
it about until 1t comes to be inserted between two other 
sides of the rectangle. If instead the appropriate con- 
choid relative to either of those sides is introduced, the 
required position of the line can be determined without 
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Conchoid curve 


the trial and error of a moving ruler. Because the same 
construction can be effected by means of a hyperbola, 
however, the problem is not linear but solid. Such uses of 
the conchoids were presented by Nicomedes (middle or 
late 3rd century BCE), and their replacement by equiva- 
lent solid constructions appears to have come soon after, 
perhaps by Apollonius or his associates. 

Some of the curves used for problem solving are not 
so reducible. For example, the Archimedean spiral cou- 
ples uniform motion of a point on a half ray with uni- 
form rotation of the ray around a fixed point at its end. 
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Such curves have their principal interest as means for 
squaring the circle and trisecting the angle. 


ApPLiED GEOMETRY 


A major activity among geometers in the 3rd cen- 
tury BCE was the development of geometric approaches 
in the study of the physical sciences—specifi- 
cally, optics, mechanics, and astronomy. In each case the 
aim was to formulate the basic concepts and principles 
in terms of geometric and numerical quantities and then 
to derive the fundamental phenomena of the field by 
geometric constructions and proofs. 


Angle trisection using a conchoid 
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Angle trisection using a hyperbola 


In optics, Euclid’s textbook (called the Optics) set 
the precedent. Euclid postulated visual rays to be straight 
lines, and he defined the apparent size of an object in 
terms of the angle formed by the rays drawn from the 
top and the bottom of the object to the observer’s eye. 
He then proved, for example, that nearer objects appear 
larger and appear to move faster and showed how to 
measure the height of distant objects from their shadows 
or reflected images and so on. Other textbooks set out 
theorems on the phenomena of reflection and refraction 
(the field called catoptrics). The most extensive survey of 
optical phenomena is a treatise attributed to the astrono- 
mer Ptolemy (2nd century CE), which survives only in the 
form of an incomplete Latin translation (12th century) 
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TRISECTING THE ANGLE: THE 
QUADRATRIX OF HIPPIAS 


Hippias of Elis (fl. 5th century BCE) imagined a mechanical 
device to divide arbitrary angles into various proportions. 
His device depends on a curve, now known as the quadra- 
trix of Hippias, that is produced by plotting the intersection 
of two moving line segments. Starting from a horizontal 
position, one segment is rotated at a constant rate through 
a right angle around one of its endpoints, while the second 
segment glides uniformly through a vertical distance equal 
to the first segment’s length. Because both the angle rota- 
tion and the vertical displacement are produced by uniform 
motion, each moves through the same fraction of its entire 
journey in the same time. Hence, finding some proportion 
(say one-third) for a given angle (here 2COA) is simple: find 
the equal proportion for vertical displacement of the point 
on the quadratrix at which the two segments intersect (C), 
locate the point (F) on the quadratrix at that height (one- 
third of the original height), and then draw the new angle 
(ZFOA) through that point. 


based on a lost Arabic translation. It covers the fields of 
geometric optics and catoptrics, as well as experimental 
areas, such as binocular vision, and more general phil- 
osophical principles (the nature of light, vision, and 
colour). Of a somewhat different sort are the studies 
of burning mirrors by Diocles (late 2nd century BCE), 
who proved that the surface that reflects the rays from 
the Sun to a single point is a paraboloid of revolution. 


ANCIENT WESTERN MATHEMATICS 
i 


Constructions of such devices remained of interest as 
late as the 6th century CE, when Anthemius of Tralles, 
best known for his work as architect of Hagia Sophia at 
Constantinople, compiled a survey of remarkable mirror 
configurations. 

Mechanics was dominated by the work of Archi- 
medes, who was the first to prove the principle of bal- 
ance: that two weights are in equilibrium when they are 
inversely proportional to their distances from the ful- 
crum. From this principle he developed a theory of the 
centres of gravity of plane and solid figures. He was also 
the first to state and prove the principle of buoyancy— 
that floating bodies displace their equal in weight—and to 
use it for proving the conditions of stability of segments 
of spheres and paraboloids, solids formed by rotating a 
parabolic segment about its axis. Archimedes proved the 
conditions under which these solids will return to their 
initial position if tipped, in particular for the positions 
now called “stable I” and “stable I,” where the vertex 
faces up and down, respectively. 

In his work Method Concerning Mechanical Theorems, 
Archimedes also set out a special “mechanical method” 
that he used for the discovery of results on volumes and 
centres of gravity. He employed the bold notion of consti- 
tuting solids from the plane figures formed as their sections 
(e.g., the circles that are the plane sections of spheres, cones, 
cylinders, and other solids of revolution), assigning to such 
figures a weight proportional to their area. For example, 
to measure the volume of a sphere, he imagined a balance 
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beam, one of whose arms 1s a diameter of the sphere with 
the fulcrum at one endpoint of this diameter and the other 
arm an extension of the diameter to the other side of the 
fulcrum by a length equal to the diameter. 

Archimedes showed that the three circular cross sec- 
tions made by a plane cutting the sphere and the associated 
cone and cylinder will be in balance (the circle in the cylin- 
der with the circles in the sphere and cone) if the circle in 
the cylinder is kept in its original place while the circles in 
the sphere and cone are placed with their centres of gravity 
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This figure shows part of the hyperbolic paraboloid 


at the opposite end of the balance. Doing this for all the sets 
of circles formed as cross sections of these solids by planes, 
he concluded that the solids themselves are in balance— 
the cylinder with the sphere and the cone together—if the 
cylinder is left where it is, while the sphere and cone are 
placed with their centres of gravity at the opposite end of 
the balance. Since the centre of gravity of the cylinder is the 
midpoint of its axis, it follows that (sphere + cone):cylinder 
= |:2 (by the inverse proportion of weights and distances). 
Since the volume of the cone is one-third that of the cyl- 
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ellipsoid cease inder, however, 
the volume of the 
sphere is found to 
be one-sixth that 
of the cylinder. 
In similar man- 
ner, Archimedes 
worked out the 
volumes and cen- 
An ellipsoid is a closed surface tres of gravity 
such that its intersection with any of spherical seg- 
plane will produce an ellipse or a 
circle. 


ments and segments 
of the solids of 
revolution of conic 
sections—paraboloids, ellipsoids, and hyperboloids. The 
critical notions—constituting solids out of their plane sec- 
tions and assigning weights to geometric figures—were not 
formally valid within the standard conceptions of Greek 
geometry, and Archimedes admitted this. But he main- 
tained that, although his arguments were not “demon- 
strations” (1.e., proofs), they had value for the discovery of 
results about these figures. 

The geometric study of astronomy has pre-Eu- 
clidean roots, Eudoxus having developed a model for 
planetary motions around a stationary Earth. Accepting 
the principle—which, according to Eudemus, was first 
proposed by Plato—that only combinations of uniform 
circular motions are to be used, Eudoxus represented the 
path of a planet as the result of superimposing rotations 
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of three or more concentric spheres whose axes are set 
at different angles. Although the fit with the phenomena 
was unsatisfactory, the curves thus generated (the hippo- 
pede, or “horse-fetter”) continued to be of interest for 
their geometric properties, as is known through remarks 
by Proclus. Later geometers continued the search for geo- 
metric patterns satisfying the Platonic conditions. The 
simplest model, a scheme of circular orbits centred on 
the Sun, was introduced by Aristarchus of Samos (3rd 
century BCE), but this was rejected by others, since a 
moving Earth was judged to be impossible on physical 
grounds. But Aristarchus’s scheme could have suggested 
use of an “eccentric” model, in which the planets rotate 
about the Sun and the Sun in turn rotates about the 
Earth. Apollonius introduced an alternative “epicyclic” 
model, in which the planet turns about a point that 
itself orbits in a circle (the “deferent’”’) centred at or near 
the Earth. As Apollonius knew, his epicyclic model is 
geometrically equivalent to an eccentric. These models 
were well adapted for explaining other phenomena of 
planetary motion. For instance, if the Earth is displaced 
from the centre of a circular orbit (as in the eccentric 
scheme), the orbiting body will appear to vary in speed 
(appearing faster when nearer the observer, slower when 
farther away), as is in fact observed for the Sun, Moon, 
and planets. By varying the relative sizes and rotation 
rates of the epicycle and deferent, in combination with 
the eccentric, a flexible device may be obtained for repre- 
senting planetary motion. 
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apogee 


perigee 


In Ptolemy's geocentric model of the universe, the Sun, 
the Moon, and each planet orbit a stationary Earth. 


LATER TRENDS IN GEOMETRY AND ARITHMETIC 


GREEK TRIGONOMETRY AND MENSURATION 


After the 3rd century BCE, mathematical research 
shifted increasingly away from the pure forms of con- 
structive geometry toward areas related to the applied 
disciplines, in particular to astronomy. The necessary 
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theorems on the geometry of the sphere (called spher- 
ics) were compiled into textbooks, such as the one by 
Theodosius (3rd or 2nd century BCE) that consolidated 
the earlier work by Euclid and the work of Autolycus of 
Pitane (flourished c. 300 BCE) on spherical astronomy. 
More significant, in the 2nd century BCE the Greeks first 
came into contact with the fully developed Mesopota- 
mian astronomical systems and took from them many of 
their observations and parameters (for example, values 
for the average periods of astronomical phenomena). 
While retaining their own commitment to geometric 
models rather than adopting the arithmetic schemes of 
the Mesopotamians, the Greeks nevertheless followed 
the Mesopotamians’ lead in seeking a predictive astron- 
omy based on a combination of mathematical theory 
and observational parameters. They thus made it their 
goal not merely to describe but to calculate the angular 
positions of the planets on the basis of the numerical 
and geometric content of the theory. This major restruc- 
turing of Greek astronomy, in both its theoretical and 
practical respects, was primarily due to Hipparchus (2nd 
century BCE), whose work was consolidated and further 
advanced by Ptolemy. 

To facilitate their astronomical researches, the 
Greeks developed techniques for the numerical mea- 
surement of angles, a precursor of trigonometry, and 
produced tables suitable for practical computation. 
Early efforts to measure the numerical ratios in trian- 
gles were made by Archimedes and Aristarchus. Their 
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results were soon extended, and comprehensive treatises 
on the measurement of chords (in effect, a construc- 
tion of a table of values equivalent to the trigonometric 
sine) were produced by Hipparchus and by Menelaus of 
Alexandria (1st century CE). These works are now lost, 
but the essential theorems and tables are preserved in 
Ptolemy’s A/magest (Book I, chapter 10). For comput- 
ing with angles, the Greeks adopted the Mesopotamian 
sexagesimal method in arithmetic, whence it survives 
in the standard units for angles and time employed to 
this day. 


NUMBER THEORY 


Although Euclid handed down a precedent for number 
theory in Books VIU-IX of the Elements, later writers 
made no further effort to extend the field of theoreti- 
cal arithmetic in his demonstrative manner. Beginning 
with Nicomachus of Gerasa (flourished c. CE 100), 
several writers produced collections expounding a much 
simpler form of number theory. A favourite result is the 
representation of arithmetic progressions in the form of 
“polygonal numbers.” For instance, if the numbers 1, 2, 
3, 4,...are added successively, the “triangular” numbers 
1, 3, 6, 10,...are obtained; similarly, the odd numbers 
1, 3, 5, 7,...sum to the “square” numbers 1, 4, 9, 16,..., 
while the sequence 1, 4, 7, 10,..., with a constant differ- 
ence of 3, sums to the “pentagonal” numbers 1, 5, 12, 
22,.... In general, these results can be expressed in the 
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form of geometric shapes formed by lining up dots in 
the appropriate two-dimensional configurations. In the 
ancient arithmetics such results are invariably presented 
as particular cases, without any general notational 
method or general proof. The writers in this tradition are 
called neo-Pythagoreans, since they viewed themselves 
as continuing the Pythagorean school of the 5th cen- 
tury BCE, and, in the spirit of ancient Pythagoreanism, 
they tied their numerical interests to a philosophical the- 
ory that was an amalgam of Platonic metaphysical and 
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The ancient Greeks generally thought of 
numbers in concrete terms, particularly as 
measurements and geometric dimensions. 
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theological doctrines. With its exponent Iamblichus of 
Chalcis (4th century CE), neo-Pythagoreans became a 
prominent part of the revival of pagan religion in oppo- 
sition to Christianity in late antiquity. 

An interesting concept of this school of thought, 
which Iamblichus attributes to Pythagoras himself, is 
that of “amicable numbers”: two numbers are amicable 
if each is equal to the sum of the proper divisors of the 
other (for example, 220 and 284). Attributing virtues 
such as friendship and justice to numbers was character- 
istic of the Pythagoreans at all times. 

Of much greater mathematical significance is the 
arithmetic work of Diophantus of Alexandria (c. 3rd 
century CE). His writing, the Arithmetica, originally in 
13 books (six survive in Greek, another four in medie- 
val Arabic translation), sets out hundreds of arithmetic 
problems with their solutions. For example, Book II, 
problem 8, seeks to express a given square number as 
the sum of two square numbers (here and throughout, 
the “numbers” are rational). Like those of the neo-Py- 
thagoreans, his treatments are always of particular cases 
rather than general solutions; thus, in this problem the 
given number is taken to be 16, and the solutions worked 
out are 256/25 and 144/25. In this example, as is often 
the case, the solutions are not unique; indeed, in the very 
next problem Diophantus shows how a number given as 
the sum of two squares (e.g., 13 = 4 + 9) can be expressed 
differently as the sum of two other squares (for example, 
13 = 324/25 + 1/25). 
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To find his solutions, Diophantus adopted an arith- 
metic form of the method of analysis. He first refor- 
mulated the problem in terms of one of the unknowns, 
and he then manipulated it as if it were known until an 
explicit value for the unknown emerged. He even adopted 
an abbreviated notational scheme to facilitate such oper- 
ations, where, for example, the unknown is symbolized 
by a figure somewhat resembling the Roman letter S. 
(This is a standard abbreviation for the word number in 
ancient Greek manuscripts.) Thus, in the first problem 
discussed above, if S is one of the unknown solutions, 
then 16-S’ isa square; supposing the other unknown 
to be 2S — 4 (where the 2 is arbitrary but the 4 chosen 
because it is the square root of the given number 16), 
Diophantus found from summing the two unknowns 
([2S — 4] and S’) that 45° — 16S + 16+ S = 16, or 5S = 
16S; that is, S = 16/5. So one solution is S = 250/25, 
while the other solution is 16 — S” , or 144/25. 


SURVIVAL AND INFLUENCE OF GREEK MATHEMATICS 


Notable in the closing phase of Greek mathematics 
were Pappus (early 4th century CE), Theon (late 4th 
century), and Theon’s daughter Hypatia. All were active 
in Alexandria as professors of mathematics and astron- 
omy, and they produced extensive commentaries on 
the major authorities—Pappus and Theon on Ptolemy, 
Hypatia on Diophantus and Apollonius. Later, Eutocius 
of Ascalon (early 6th century) produced commentaries 
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on Archimedes and Apollonius. While much of their 
output has since been lost, much survives. They proved 
themselves reasonably competent in technical matters 
but little inclined toward significant insights (their aim 
was usually to fill in minor steps assumed in the proofs, 
to append alternative proofs, and the like), and their level 
of originality was very low. But these scholars frequently 
preserved fragments of older works that are now lost, 
and their teaching and editorial efforts assured the sur- 
vival of the works of Euclid, Archimedes, Apollonius, 
Diophantus, Ptolemy, and others that now do exist, 
either in Greek manuscripts or in medieval translations 
(Arabic, Hebrew, and Latin) derived from them. 

The legacy of Greek mathematics, particularly in the 
fields of geometry and geometric science, was enormous. 
From an early period the Greeks formulated the objectives 
of mathematics not in terms of practical procedures but 
as a theoretical discipline committed to the development 
of general propositions and formal demonstrations. The 
range and diversity of their findings, especially those of 
the masters of the 3rd century BCE, supplied geometers 
with subject matter for centuries thereafter, even though 
the tradition that was transmitted into the Middle Ages 
and Renaissance was incomplete and defective. 

The rapid rise of mathematics in the 17th cen- 
tury was based in part on the conscious imitation of the 
ancient classics and on competition with them. In the 
geometric mechanics of Galileo and the infinitesimal 
researches of Johannes Kepler and Bonaventura Cavalieri, 
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it is possible to perceive a direct inspiration from Archime- 
des. The study of the advanced geometry of Apollonius 
and Pappus stimulated new approaches in geometry—for 
example, the analytic methods of René Descartes and the 
projective theory of Girard Desargues. Purists like Chris- 
tiaan Huygens and Isaac Newton insisted on the Greek 
geometric style as a model of rigour, just as others sought 
to escape its forbidding demands of completely worked-out 
proofs. The full impact of Diophantus’s work is evident par- 
ticularly with Pierre de Fermat in his researches in algebra 
and number theory. Although mathematics has today gone 
far beyond the ancient achievements, the leading figures of 
antiquity, like Archimedes, Apollonius, and Ptolemy, can 
still be rewarding reading for the ingenuity of their insights. 


MATHEMATICS IN THE ISLAMIC 
Wor pb (8TH-15TH CENTURY) 


ORIGINS 


In Hellenistic times and in late antiquity, scientific learn- 
ing in the eastern part of the Roman world was spread 
over a variety of centres, and Justinian’s closing of the 
pagan academies in Athens in 529 gave further impetus 
to this diffusion. An additional factor was the trans- 
lation and study of Greek scientific and philosophical 
texts sponsored both by monastic centres of the various 
Christian churches in the Levant, Egypt, and Mesopota- 
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mia and by enlightened rulers of the Sasanian dynasty in 
places like the medical school at Gondeshapur. 

Also important were developments in India in 
the first few centuries CE. Although the decimal sys- 
tem for whole numbers was apparently not known to 
the Indian astronomer Aryabhata (born 476), it was 
used by his pupil Bhaskara I in 620, and by 670 the 
system had reached northern Mesopotamia, where the 
Nestorian bishop Severus Sebokht praised its Hindu 
inventors as discoverers of things more ingenious than 
those of the Greeks. Earlier, in the late 4th or early Sth 
century, the anonymous Hindu author of an astronom- 
ical handbook, the Surya Siddhanta, had tabulated the 
sine function (unknown in Greece) for every 37 , of arc 
from 3'/,° to 90°. 

Within this intellectual context the rapid expan- 
sion of Islam took place between the time of Mu 
hammad’s return to Mecca in 630 from his exile in 
Medina and the Muslim conquest of lands extend- 
ing from Spain to the borders of China by 715. Not 
long afterward, Muslims began the acquisition of for- 
eign learning, and, by the time of the caliph al-Man 
sir (died 775), such Indian and Persian astronom- 
ical material as the Brahma-sphuta-siddhanta and 
the Shah’s Tables had been translated into Arabic. 
The subsequent acquisition of Greek material was 
greatly advanced when the caliph al-Ma’min con- 
structed a translation and research centre, the House 
of Wisdom, in Baghdad during his reign (813-833). 
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This map spans more than 600 years of prominent 
Islamic mathematicians. 


Most of the translations were done from Greek and 
Syriac by Christian scholars, but the impetus and 
support for this activity came from Muslim patrons. 
These included not only the caliph but also wealthy 
individuals such as the three brothers known as the 
Bani Misa, whose treatises on geometry and mechan- 
ics formed an important part of the works studied in 
the Islamic world. 

Of Euclid’s works the Elements, the Data, 
the Optics, the Phaenomena, and On Divisions were 
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translated. Of Archimedes’ works only two—Sphere and 
Cylinder and Measurement of the Circle—are known to 
have been translated, but these were sufficient to stim- 
ulate independent researches from the 9th to the 15th 
century. On the other hand, virtually all of Apollonius’s 
works were translated, and of Diophantus and Mene- 
laus one book each, the Arithmetica and the Sphaerica, 
respectively, were translated into Arabic. Finally, the 
translation of Ptolemy’s A/magest furnished important 
astronomical material. 

Of the minor writings, Diocles’ treatise on mirrors, 
Theodosius’s Spherics, Pappus’s work on mechanics, 
Ptolemy’s Planisphaerium, and Hypsicles’ treatises on 
regular polyhedra (the so-called Books XIV and XV of 
Euclid’s Elements) were among those translated. 


MATHEMATICS IN THE 9TH CENTURY 


Thabit ibn Qurrah (836-901), a Sabian from Harran in 
northern Mesopotamia, was an important translator 
and reviser of these Greek works. In addition to trans- 
lating works of the major Greek mathematicians (for the 
Bant Misa, among others), he was a court physician. He 
also translated Nicomachus of Gerasa’s Arithmetic and 
discovered a beautiful rule for finding amicable numbers, 
a pair of numbers such that each number is the sum of 
the set of proper divisors of the other number. The inves- 
tigation of such numbers formed a continuing tradition 
in Islam. Kamal al-Din al-Farisi (died c. 1320) gave the 
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pair 17,926 and 18,416 as an example of Thabit’s rule, 
and in the 17th century Muhammad Baqir Yazdi gave 
the pair 9,363,584 and 9,437,056. 

One scientist who was typical of the 9th century 
was Muhammad ibn Misa al-Khwarizmi. Working in 
the House of Wisdom, he introduced Indian material 
in his astronomical works and also wrote an early book 
explaining Hindu arithmetic, the Book of Addition 
and Subtraction According to the Hindu Calculation. In 
another work, the Book of Restoring and Balancing, he 
provided a systematic introduction to algebra, including 
a theory of quadratic equations. Both works had import- 
ant consequences for Islamic mathematics. Hindu Calcu- 
lation began a tradition of arithmetic books that, by the 
middle of the next century, led to the invention of decimal 
fractions (complete with a decimal point), and Restoring 
and Balancing became the point of departure and model 
for later writers such as the Egyptian Abi Kamil. Both 
books were translated into Latin, and Restoring and Bal- 
ancing was the origin of the word algebra, from the Arabic 
word for “restoring” in its title (a/-jabr). The Hindu Calcu- 
lation, from a Latin form of the author’s name, a/gorismi, 
yielded the word algorithm. 

Al-Khwarizmi’s algebra also served as a model 
for later writers in its application of arithmetic and 
algebra to the distribution of inheritances according 
to the complex requirements of Muslim religious law. 
This tradition of service to the Islamic faith was an 
enduring feature of mathematical work in Islam and 
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one that, in the eyes of many, justified the study of sec- 
ular learning. In the same category are al-Khwarizmi’s 
method of calculating the time of visibility of the 
new moon (which signals the beginning of the Mus- 
lim month) and the expositions by astronomers of 
methods for finding the direction to Mecca for the five 
daily prayers. 


MATHEMATICS IN THE 10TH CENTURY 


Islamic scientists in the 10th century were involved in 
three major mathematical projects: the completion of 
arithmetic algorithms, the development of algebra, and 
the extension of geometry. 

The first of these projects led to the appearance 
of three complete numeration systems, one of which 
was the finger arithmetic used by the scribes and trea- 
sury Officials. This ancient arithmetic system, which 
became known throughout the East and Europe, 
employed mental arithmetic and a system of storing 
intermediate results on the fingers as an aid to mem- 
ory. (Its use of unit fractions recalls the Egyptian 
system.) During the 10th and 11th centuries capable 
mathematicians, such as Abi’l-Wafa (940-997/998), 
wrote on this system, but it was eventually replaced by 
the decimal system. 

A second common system was the base-60 numera- 
tion inherited from the Babylonians via the Greeks and 
known as the arithmetic of the astronomers. Although 
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astronomers used this system for their tables, they usu- 
ally converted numbers to the decimal system for com- 
plicated calculations and then converted the answer back 
to sexagesimals. 

The third system was Indian arithmetic, whose 
basic numeral forms, complete with the zero, eastern 
Islam took over from the Hindus. (Different forms 
of the numerals, whose origins are not entirely clear, 
were used in western Islam.) The basic algorithms also 
came from India, but these were adapted by al-Uqlidist 
(c. 950) to pen and paper instead of the traditional 
dust board, a move that helped to popularize this sys- 
tem. Also, the arithmetic algorithms were completed 
in two ways: by the extension of root-extraction pro- 
cedures, known to Hindus and Greeks only for square 
and cube roots, to roots of higher degree and by the 
extension of the Hindu decimal system for whole 
numbers to include decimal fractions. These fractions 
appear simply as computational devices in the work 
of both al-Ugqlidisi and al-Baghdadi (c. 1000), but in 
subsequent centuries they received systematic treat- 
ment as a general method. As for extraction of roots, 
Abia’l-Wafa wrote a treatise (now lost) on the topic, 
and Omar Khayyam (1048-1131) solved the general 
problem of extracting roots of any desired degree. 
Omar’s treatise too is lost, but the method is known 
from other writers, and it appears that a major step in 
its development was al-Karaji’s 10th-century deriva- 
tion by means of mathematical induction of the bino- 
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mial theorem for whole-number exponents—i.e., his 
discovery that 
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During the 10th century Islamic algebraists pro- 
gressed from al-Khwéarizmi’s quadratic polynomials 
to the mastery of the algebra of expressions involving 
arbitrary positive or negative integral powers of the 
unknown. Several algebraists explicitly stressed the anal- 
ogy between the rules for working with powers of the 
unknown in algebra and those for working with powers 
of 10 in arithmetic, and there was interaction between 
the development of arithmetic and algebra from the 10th 
to the 12th century. A 12th-century student of al-Kara- 
ji’s works, al-Samaw«al, was able to approximate the 
quotient (20x + 30x)/(6x + 12) as 
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and also gave a rule for finding the coefficients of the succes- 
sive powers of 1/x. Although none of this employed sym- 
bolic algebra, algebraic symbolism was in use by the 14th 
century in the western part of the Islamic world. The 
context for this well-developed symbolism was, it seems, 
commentaries that were destined for teaching purposes, 
such as that of Ibn Qunftidh (1330-1407) of Algeria on 
the algebra of Ibn al-Bann& (1256-1321) of Morocco. 
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Other parts of algebra developed as well. Both 
Greeks and Hindus had studied indeterminate equations, 
and the translation of this material and the application 
of the newly developed algebra led to the investigation 
of Diophantine equations by writers like Abi Kamil, 
al-Karaji, and Abi Ja‘far al-Khazin (first half of 10th 
century), as well as to attempts to prove a special case of 
what is now known as Fermat’s last theorem—namely, 
that there are no rational solutions to x + y =z. The 
great scientist Ibn al-Haytham (965-1040) solved prob- 
lems involving congruences by what is now called Wilson’s 
theorem, which states that, if p is a prime, then p divides 
(p — 1) X (p — 2)---X 2 X 1 + 1, and al-Baghdadi gave 
a variant of the idea of amicable numbers by defining 
two numbers to “balance” if the sums of their divisors 
are equal. 

However, not only arithmetic and algebra 
but geometry too underwent extensive development. 
Thabit ibn Qurrah, his grandson Ibrahim ibn Sinan 
(909-946), Abi Sahl al-Kiht (died c. 995), and Ibn 
al-Haytham solved problems involving the pure geom- 
etry of conic sections, including the areas and volumes 
of plane and solid figures formed from them, and also 
investigated the optical properties of mirrors made 
from conic sections. Ibrahim ibn Sinan, Abu Sahl 
al-Kuihi, and Ibn al-Haytham used the ancient tech- 
nique of analysis to reduce the solution of problems 
to constructions involving conic sections. (Ibn al-Hay- 
tham, for example, used this method to find the point 
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on a convex spherical mirror at which a given object is 
seen by a given observer.) Thabit and Ibrahim showed 
how to design the curves needed for sundials. Abi’l- 
Wafa, whose book on the arithmetic of the scribes 
is mentioned above, also wrote on geometric methods 
needed by artisans. 

In addition, in the late 10th century 
Abi'l-Wafa and the prince Abi Nasr Mansur stated 
and proved theorems of plane and spherical geome- 
try that could be applied by astronomers and geogra- 
phers, including the laws of sines and tangents. Abi 
Nasr’s pupil al-Birini (973-1048), who produced a vast 
amount of high-quality work, was one of the masters 
in applying these theorems to astronomy and to such 
problems in mathematical geography as the determina- 
tion of latitudes and longitudes, the distances between 
cities, and the direction from one city to another. 


Omar KHayYAm 


The mathematician and poet Omar Khayyam was born 
in Neyshabir (in Iran) only a few years before al-Biriint’s 
death. He later lived in Samarkand and Esfahan, and 
his brilliant work there continued many of the main 
lines of development in 10th-century mathematics. Not 
only did he discover a general method of extracting 
roots of arbitrary high degree, but his A/gebra contains 
the first complete treatment of the solution of cubic 
equations. Omar did this by means of conic sections, but 
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he declared his hope that his successors would succeed 
where he had failed in finding an algebraic formula for 
the roots. 

Omar was also a part of an Islamic tradition, 
which included Thabit and Ibn al-Haytham, of inves- 
tigating Euclid’s parallel postulate. To this tradition 
Omar contributed the idea of a quadrilateral with two 
congruent sides perpendicular to the base. The paral- 
lel postulate would be proved, Omar recognized, if he 
could show that the remaining two angles were right 
angles. In this he failed, but his question about the 
quadrilateral became the standard way of discussing 
the parallel postulate. 
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\ Z\ABD is congruent to A BAC (because they have two 
sides and the included angle that are equal, respectively). \ 
Hence, AC=BD, so A\ADC is congruent to ABCD (the 
two triangles having three equal sides). Therefore, 
ZADC =ZBCD. | 


Omar Khayyam constructed the quadrilateral 
shown in this figure in an effort to prove that 


Euclid's fifth postulate, concerning parallel lines, 
is superfluous. 
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That postulate, however, was only one of the ques- 
tions on the foundations of mathematics that interested 
Islamic scientists. Another was the definition of ratios. 
Omar Khayyam, along with others before him, felt that 
the theory in Book V of Euclid’s Elements was logically 
satisfactory but intuitively unappealing, so he proved 
that a definition known to Aristotle was equivalent to 
that given in Euclid. In fact, Omar argued that ratios 
should be regarded as “ideal numbers,” and so he con- 
ceived of a much broader system of numbers than 
that used since Greek antiquity, that of the positive 
real numbers. 


IsLaMIC MATHEMATICS TO THE 15TH CENTURY 


In the 12th century the physician al-Samaw‘al continued 
and completed the work of al-Karaji in algebra and also 
provided a systematic treatment of decimal fractions as 
a means of approximating irrational quantities. In his 
method of finding roots of pure equations, x" = N, he 
used what is now known as Horner’s method to expand 
the binomial (a + y)’. His contemporary Sharaf al-Din 
al-Tist late in the 12th century provided a method of 
approximating the positive roots of arbitrary equations, 
based on an approach virtually identical to that dis- 
covered by Francois Viete in 16th-century France. The 
important step here was less the general idea than the 
development of the numerical algorithms necessary to 
effect it. 
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Sharaf al-Din was the discoverer of a device, 
called the linear astrolabe, that places him in another 
important Islamic mathematical tradition, one that 
centred on the design of new forms of the ancient 
astronomical instrument known as the astrolabe. The 
astrolabe, whose mathematical theory is based on the 
stereographic projection of the sphere, was invented 
in late antiquity, but its extensive development in 
Islam made it the pocket watch of the medievals. In its 
original form it required a different plate of horizon 
coordinates for each latitude, but in the 11th century 
the Spanish Muslim astronomer al-Zarqallu invented 
a single plate that worked for all latitudes. Slightly ear- 
lier, astronomers in the East had experimented with 
plane projections of the sphere, and al-Birini invented 
such a projection that could be used to produce a map 
of a hemisphere. The culminating masterpiece was 
the astrolabe of the Syrian Ibn al-Shatir (1305-75), 
a mathematical tool that could be used to solve all 
the standard problems of spherical astronomy in five 
different ways. 

On the other hand, Muslim astronomers had 
developed other methods for solving these problems 
using the highly accurate trigonometry tables and the 
new trigonometry theorems they had developed. Out 
of these developments came the creation of trigonom- 
etry as a mathematical discipline, separate from its 
astronomical applications, by Nasir al-Din al-Tisi at 
his observatory in Maragheh in the 13th century. (It 
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was there too that al-Tisi’s pupil Qutb al-Din al-Shirazi 
[1236-1311] and his pupil Kamal al-Din Farisi, using 
Ibn al-Haytham’s great work, the Optics, were able to 
give the first mathematically satisfactory explanation 
of the rainbow.) 

Al-Tisi’s observatory was supported by a grand- 
son of Genghis Khan, Hulegii, who sacked Baghdad 
in 1258. Ulaigh Beg, the grandson of the Mongol con- 
queror Timur, founded an observatory at Samarkand 
in the early years of the 15th century. Uligh Beg was 
himself a good astronomer, and his tables of sines and 
tangents for every minute of arc (accurate to five sexa- 
gesimal places) were one of the great achievements in 
numerical mathematics up to his time. He was also the 
patron of Jamshid al-Kashi (died 1429), whose work The 
Reckoners’ Key summarizes most of the arithmetic of 
his time and includes sections on algebra and practical 
geometry as well. Among al-Kasht’s works is a masterful 
computation of the value of 2m, which, when expressed 
in decimal fractions, is accurate to 16 places, as well 
as the application of a numerical method, now known 
as fixed-point iteration, for solving the cubic equation 
with sin 1° as a root. His work was indeed of a quality 
deserving Uligh Beg’s description as “known among the 
famous of the world.” 

Al-Kashi lived almost five centuries after the first 
translations of Arabic material into Latin, and by his 
time the Islamic mathematical tradition had given the 
West not only its first versions of many of the Greek 
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classics but also a complete set of algorithms for Hin- 
du-Arabic arithmetic, plane and spherical trigonometry, 
and the powerful tool of algebra. Although mathemat- 
ical inquiry continued in Islam in the centuries after 
al-Kashi’s time, the mathematical centre of gravity was 
shifting to the West. That this was so 1s, of course, in no 
small measure due to what the Western mathematicians 
had learned from their Islamic predecessors during the 
preceding centuries. 


Chapter 3 


EUROPEAN 


MATHEMATICS SINCE 
THE MIDDLE AGES 


from the fallow period that had begun with the 

fall of the Roman Empire. Such mathematicians 
as Leonardo of Pisa and Nicholas Oresme were able to 
apply and extend the knowledge that had been preserved 
in the Islamic world. 


B eginning in the Middle Ages, mathematics emerged 


Until the 11th century only a small part of the Greek 
mathematical corpus was known in the West. Because 
almost no one could read Greek, what little was available 
came from the poor texts written in Latin in the Roman 
Empire, together with the very few Latin translations 
of Greek works. Of these the most important were the 
treatises by Boethius, who about CE 500 made Latin 
redactions of a number of Greek scientific and logical 
writings. His Arithmetic, which was based on Nicoma- 
chus, was well known and was the means by which med1- 
eval scholars learned of Pythagorean number theory. 
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Boethius and Cassiodorus provided the material for the 
part of the monastic education called the quadrivium: 
arithmetic, geometry, astronomy, and music theory. 
Together with the trivium (grammar, logic, rhetoric), 
these subjects formed the seven liberal arts, which were 
taught in the monasteries, cathedral schools, and, from 
the 12th century on, universities and which constituted 
the principal university instruction until modern times. 

For monastic life it sufficed to know how to calcu- 
late with Roman numerals. The principal application 
of arithmetic was a method for determining the date 
of Easter, the computus, that was based on the lunar 
cycle of 19 solar years (i.e., 235 lunar revolutions) and 
the 28-year solar cycle. Between the time of Bede (died 
735), when the system was fully developed, and about 
1500, the computus was reduced to a series of verses 
that were learned by rote. Until the 12th century, geom- 
etry was largely concerned with approximate formulas 
for measuring areas and volumes in the tradition of the 
Roman surveyors. About CE 1000 the French scholar 
Gerbert of Aurillac, later Pope Sylvester I, introduced 
a type of abacus in which numbers were represented by 
stones bearing Arabic numerals. Such novelties were 
known to very few. 


THE TRANSMISSION OF GREEK AND ARABIC LEARNING 


In the 11th century a new phase of mathematics began 
with the translations from Arabic. Scholars through- 
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out Europe went to Toledo, Cordoba, and elsewhere 
in Spain to translate into Latin the accumulated learn- 
ing of the Muslims. Along with philosophy, astron- 
omy, astrology, and medicine, important mathemat- 
ical achievements of the Greek, Indian, and Islamic 
civilizations became available in the West. Particularly 
important were Euclid’s Elements, the works of Archi- 
medes, and al-Khwarizmi’s treatises on arithmetic and 
algebra. Western texts called a/gorismus (a Latin form of 
the name al-Khwarizmi) introduced the Hindu-Arabic 
numerals and applied them in calculations. Thus, mod- 
ern numerals first came into use in universities and then 
became common among merchants and other laymen. It 
should be noted that, up to the 15th century, calculations 
were often performed with board and counters. Reckon- 
ing with Hindu-Arabic numerals was used by merchants 
at least from the time of Leonardo of Pisa (beginning of 
the 13th century), first in Italy and then in the trading 
cities of southern Germany and France, where maestri 
dabbaco or Rechenmeister taught commercial arithme- 
tic in the various vernaculars. Some schools were private, 
while others were run by the community. 


THE UNIVERSITIES 


Mathematics was studied from a theoretical standpoint 
in the universities. The Universities of Paris and Oxford, 
which were founded relatively early (c. 1200), were centres 
for mathematics and philosophy. Of particular impor- 
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tance in these universities were the Arabic-based versions 
of Euclid, of which there were at least four by the 12th 
century. Of the numerous redactions and compendia 
which were made, that of Johannes Campanus (c. 1250; 
first printed in 1482) was easily the most popular, serv- 
ing as a textbook for many generations. Such redactions 
of the Elements were made to help students not only to 
understand Euclid’s textbook but also to handle other, 
particularly philosophical, questions suggested by pas- 
sages in Aristotle. The ratio theory of the Elements pro- 
vided a means of expressing the various relations of the 
quantities associated with moving bodies, relations that 
now would be expressed by formulas. Also in Euclid were 
to be found methods of analyzing infinity and continu- 
ity (paradoxically, because Euclid always avoided infinity). 

Studies of such questions led not only to new 
results but also to a new approach to what is now called 
physics. Thomas Bradwardine, who was active in Mer- 
ton College, Oxford, in the first half of the 14th century, 
was one of the first medieval scholars to ask whether 
the continuum can be divided infinitely or whether there 
are smallest parts (indivisibles). Among other topics, 
he compared different geometric shapes in terms of the 
multitude of points that were assumed to compose them, 
and from such an approach paradoxes were generated 
that were not to be solved for centuries. Another fertile 
question stemming from Euclid concerned the angle 
between a circle and a line tangent to it (called the horn 
angle): if this angle is not zero, a contradiction quickly 
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ensues, but, if it is zero, then, by definition, there can be 
no angle. For the relation of force, resistance, and the 
speed of the body moved by this force, Bradwardine sug- 
gested an exponential law. Nicholas Oresme (died 1382) 
extended Bradwardine’s ideas to fractional exponents. 
Another question having to do with the quantifica- 
tion of qualities, the so-called latitude of forms, began 
to be discussed at about this time in Paris and in Merton 
College. Various Aristotelian qualities (e.g., heat, density, 
and velocity) were assigned an intensity and extension, 
which were sometimes represented by the height and 
bases (respectively) of a geometric figure. The area of 
the figure was then considered to represent the quantity 
of the quality. In the important case in which the quality 
is the motion of a body, the intensity its speed, and the 
extension its time, the area of the figure was taken to 
represent the distance covered by the body. Uniformly 
accelerated motion starting at zero velocity gives rise to 


Uniformly accelerated motion 


EUROPEAN MATHEMATICS SINCE THE MIDDLE AGES 
a 


a triangular figure. It was proved by the Merton school 
that the quantity of motion in such a case is equal to 
the quantity of a uniform motion at the speed achieved 
halfway through the accelerated motion; in modern 
formulation, s = ‘Lat (Merton rule). Discussions like 
this certainly influenced Galileo indirectly and may 
have influenced the founding of coordinate geometry 
in the 17th century. Another important development 
in the scholastic “calculations” was the summation of 
infinite series. 

Basing his work on translated Greek sources, about 
1464 the German mathematician and astronomer Regio- 
montanus wrote the first book (printed in 1533) in the 
West on plane and spherical trigonometry independent of 
astronomy. He also published tables of sines and tangents 
that were in constant use for more than two centuries. 


THE RENAISSANCE 


Italian artists and merchants influenced the mathematics 
of the late Middle Ages and the Renaissance in several 
ways. In the 15th century a group of Tuscan artists, 
including Filippo Brunelleschi, Leon Battista Alberti, 
and Leonardo da Vinci, incorporated linear perspec- 
tive into their practice and teaching, about a century 
before the subject was formally treated by mathematicians. 
Italian maestri d’abbaco tried, albeit unsuccessfully, to 
solve nontrivial cubic equations. In fact, the first general 
solution was found by Scipione del Ferro at the begin- 
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ning of the 16th century and rediscovered by Niccolo 
Tartaglia several years later. The solution was published 
by Gerolamo Cardano in his Ars magna (Ars Magna or 
the Rules of Algebra) in 1545, together with Lodovico 
Ferrari’s solution of the quartic equation. 

By 1380 an algebraic symbolism had been developed 
in Italy in which letters were used for the unknown, for its 
square, and for constants. The symbols used today for the 
unknown (for example, x), the square root sign, and the 
signs + and — came into general use in southern Germany 
beginning about 1450. They were used by Regiomonta- 
nus and by Fridericus Gerhart and received an impetus 
about 1486 at the University of Leipzig from Johann 
Widman. The idea of distinguishing between known 
and unknown quantities in algebra was first consistently 
applied by Francois Viete, with vowels for unknown and 
consonants for known quantities. Viete found some rela- 
tions between the coefficients of an equation and its roots. 
This was suggestive of the idea, explicitly stated by Albert 
Girard in 1629 and proved by Carl Friedrich Gauss in 
1799, that an equation of degree n has n roots. Complex 
numbers, which are implicit in such ideas, were gradually 
accepted about the time of Rafael Bombelli (died 1572), 
who used them in connection with the cubic. 

Apollonius’s Conics and the investigations of areas 
(quadratures) and of volumes (cubatures) by Archime- 
des formed part of the humanistic learning of the 16th 
century. These studies strongly influenced the later devel- 
opments of analytic geometry, the infinitesimal calculus, 
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and the theory of functions, subjects that were developed 
in the 17th century. 


MATHEMATICS IN THE 17TH AND 
18TH CENTURIES 


The 17th century, the period of the scientific revolution, wit- 
nessed the consolidation of Copernican heliocentric astron- 
omy and the establishment of inertial physics in the work 
of Johannes Kepler, Galileo, René Descartes, and Isaac 
Newton. This period was also one of intense activity and 
innovation in mathematics. Advances in numerical calcu- 
lation, the development of symbolic algebra and analytic 
geometry, and the invention of the differential and integral 
calculus resulted in a major expansion of the subject areas 
of mathematics. By the end of the 17th century, a program 
of research based in analysis had replaced classical Greek 
geometry at the centre of advanced mathematics. In the next 
century this program would continue to develop in close 
association with physics, more particularly mechanics and 
theoretical astronomy. The extensive use of analytic meth- 
ods, the incorporation of applied subjects, and the adoption 
of a pragmatic attitude to questions of logical rigour distin- 
guished the new mathematics from traditional geometry. 


INSTITUTIONAL BACKGROUND 


Until the middle of the 17th century, mathematicians 
worked alone or in small groups, publishing their work in 
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books or communicating with other researchers by letter. 
Ata time when people were often slow to publish, “invis- 
ible colleges,” networks of scientists who corresponded 
privately, played an important role in coordinating and 
stimulating mathematical research. Marin Mersenne in 
Paris acted as a clearinghouse for new results, informing 
his many correspondents—including Pierre de Fermat, 
Descartes, Blaise Pascal, Gilles Personne de Roberval, 
and Galileo—of challenge problems and novel solutions. 
Later in the century John Collins, librarian of Lon- 
don’s Royal Society, performed a similar function among 
British mathematicians. 

In 1660 the Royal Society of London was founded, 
to be followed in 1666 by the French Academy of Sci- 
ences, in 1700 by the Berlin Academy, and in 1724 by 
the St. Petersburg Academy. The official publications 
sponsored by the academies, as well as independent 
journals such as the Acta Eruditorum (founded in 
1682), made possible the open and prompt communi- 
cation of research findings. Although universities in the 
17th century provided some support for mathematics, 
they became increasingly ineffective as state-supported 
academies assumed direction of advanced research. 


NumerIcAL CALCULATION 


The development of new methods of numerical calcula- 
tion was a response to the increased practical demands 
of numerical computation, particularly in trigonometry, 
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navigation, and astronomy. New ideas spread quickly 
across Europe and resulted by 1630 in a major revolu- 
tion in numerical practice. 

Simon Stevin of Holland, in his short pamphlet La 
Disme (1585), introduced decimal fractions to Europe 
and showed how to extend the principles of Hindu-Ar- 
abic arithmetic to calculation with these numbers. Ste- 
vin emphasized the utility of decimal arithmetic “for all 
accounts that are encountered in the affairs of men,” and 
he explained in an appendix how it could be applied to 
surveying, stereometry, astronomy, and mensuration. 
His idea was to extend the base-10 positional principle 
to numbers with fractional parts, with a corresponding 
extension of notation to cover these cases. In his system 
the number 237.578 was denoted 


237 0) 5 1) 7 @) 8 B), 


in which the digits to the left of the zero are the integral 
part of the number. To the right of the zero are the digits 
of the fractional part, with each digit succeeded by a cir- 
cled number that indicates the negative power to which 
10 is raised. Stevin showed how the usual arithmetic of 
whole numbers could be extended to decimal fractions, 
using rules that determined the positioning of the nega- 
tive powers of 10. 

In addition to its practical utility, La Disme was 
significant for the way it undermined the dominant style 
of classical Greek geometry in theoretical mathematics. 
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Stevin’s proposal required a rejection of the distinction 
in Euclidean geometry between magnitude, which is con- 
tinuous, and number, which is a multitude of indivisible 
units. For Euclid, unity, or one, was a special sort of thing, 
not number but the origin, or principle, of number. The 
introduction of decimal fractions seemed to imply that the 
unit could be subdivided and that arbitrary continuous 
magnitude could be represented numerically; it implicitly 
supposed the concept of a general positive real number. 

Tables of logarithms were first published in 1614 by 
the Scottish laird John Napier in his treatise Description 
of the Marvelous Canon of Logarithms. This work was 
followed (posthumously) five years later by another in 
which Napier set forth the principles used in the con- 
struction of his tables. The basic idea behind logarithms 
is that addition and subtraction are easier to perform 
than multiplication and division, which, as Napier 
observed, require a “tedious expenditure of time” and 
are subject to “slippery errors.” By the law of expo- 
nents, aa’ =a’ +’; that is, in the multiplication of num- 
bers, the exponents are related additively. By correlating 
the geometric sequence of numbers a, a: a. ..(a 1s called 
the base) and the arithmetic sequence 1, 2, 3,...and inter- 
polating to fractional values, it is possible to reduce the 
problem of multiplication and division to one of addi- 
tion and subtraction. To do this Napier chose a base that 
was very close to 1, differing from it by only 1/ 10’. The 
resulting geometric sequence therefore yielded a dense 
set of values, suitable for constructing a table. 
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In his work of 1619 Napier presented an inter- 
esting kinematic model to generate the geometric and 
arithmetic sequences used in the construction of his 
tables. Assume two particles move along separate lines 
from given initial points. The particles begin moving at 
the same instant with the same velocity. The first par- 
ticle continues to move with a speed that is decreasing, 
proportional at each instant to the distance remaining 
between it and some given fixed point on the line. The 
second particle moves with a constant speed equal to 
its initial velocity. Given any increment of time, the dis- 
tances traveled by the first particle in successive incre- 
ments form a geometrically decreasing sequence. The 
corresponding distances traveled by the second particle 
form an arithmetically increasing sequence. Napier was 
able to use this model to derive theorems yielding precise 
limits to approximate values in the two sequences. 

Napier’s kinematic model indicated how skilled 
mathematicians had become by the early 17th century in 
analyzing nonuniform motion. Kinematic ideas, which 
appeared frequently in mathematics of the period, pro- 
vided a clear and visualizable means for the generation 
of geometric magnitude. The conception of a curve 
traced by a particle moving through space later played a 
significant role in the development of the calculus. 

Napier’s ideas were taken up and revised by the 
English mathematician Henry Briggs, the first Savil- 
ian Professor of Geometry at Oxford. In 1624 Briggs 
published an extensive table of common logarithms, 
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or logarithms to the base 10. Because the base was no 
longer close to 1, the table could not be obtained as sim- 
ply as Napier’s, and Briggs therefore devised techniques 
involving the calculus of finite differences to facilitate 
calculation of the entries. He also devised interpolation 
procedures of great computational efficiency to obtain 
intermediate values. 

In Switzerland the instrument maker Joost 
Burgi arrived at the idea for logarithms independently 
of Napier, although he did not publish his results until 
1620. Four years later a table of logarithms prepared 
by Kepler appeared in Marburg. Both Burgi and Kepler 
were astronomical observers, and Kepler included loga- 
rithmic tables in his famous Tabulae Rudolphinae (1627; 
“Rudolphine Tables”), astronomical tabulations of 
planetary motion derived by using the assumption of 
elliptical orbits about the Sun. 


ANALYTIC GEOMETRY 


The invention of analytic geometry was, next to the dif- 
ferential and integral calculus, the most important math- 
ematical development of the 17th century. Originating 
in the work of the French mathematicians Viete, Fermat, 
and Descartes, it had by the middle of the century estab- 
lished itself as a major program of mathematical research. 

Two tendencies in contemporary mathematics 
stimulated the rise of analytic geometry. The first was 
an increased interest in curves, resulting in part from the 
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recovery and Latin translation of the classical treatises 
of Apollonius, Archimedes, and Pappus, and in part 
from the increasing importance of curves in such applied 
fields as astronomy, mechanics, optics, and stereometry. 
The second was the emergence a century earlier of an 
established algebraic practice in the work of the Italian 
and German algebraists and its subsequent shaping 
by Viete into a powerful mathematical tool at the end of 
the century. 

Viete was a prominent representative of the human- 
ist movement in mathematics that set itself the project of 
restoring and furthering the achievements of the Classical 
Greek geometers. In his Jn artem analyticem isagoge (1591; 
“Introduction to the Analytic Arts”), Viete, as part of his 
program of rediscovering the method of analysis used by 
the ancient Greek mathematicians, proposed new alge- 
braic methods that employed variables, constants, and 
equations, but he saw this as an advancement over the 
ancient method, a view he arrived at by comparing the 
geometric analysis contained in Book VII of Pappus’s Col- 
lection with the arithmetic analysis of Diophantus’s Arith- 
metica. Pappus had employed an analytic method for the 
discovery of theorems and the construction of problems; 
in analysis, by contrast to synthesis, one proceeds from 
what is sought until one arrives at something known. In 
approaching an arithmetic problem by laying down an 
equation among known and unknown magnitudes and 
then solving for the unknown, one was, Viete reasoned, 
following an “analytic” procedure. 
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Viéte introduced the concept of algebraic variable, 
which he denoted using a capital vowel (4, E, I, O, U), 
as well as the concept of parameter (an unspecified 
constant quantity), denoted by a capital consonant 
(B, C D, and so on). In his system the equation 5BA — 
2CA + A =D would appear as B5 in A quad — C plano 
2in A + A cub aequatur D solido. 

Viéte retained the classical principle of homogene- 
ity, according to which terms added together must all be 
of the same dimension. In the above equation, for exam- 
ple, each of the terms has the dimension of a solid or 
cube; thus, the constant C, which denotes a plane, is com- 
bined with A to form a quantity having the dimension of 
a solid. 

It should be noted that in Viete’s scheme the sym- 
bol A is part of the expression for the object obtained 
by operating on the magnitude denoted by A. Thus, 
operations on the quantities denoted by the variables 
are reflected in the algebraic notation itself. This inno- 
vation, considered by historians of mathematics to be 
a major conceptual advance in algebra, facilitated the 
study of the symbolic solution of algebraic equations 
and led to the creation of the first conscious theory 
of equations. 

After Viéte’s death the analytic art was applied to 
the study of curves by his countrymen Fermat and Des- 
cartes. Both men were motivated by the same goal, to 
apply the new algebraic techniques to Apollonius’s the- 
ory of loci as preserved in Pappus’s Collection. The most 


EUROPEAN MATHEMATICS SINCE THE MIDDLE AGES 
a 


celebrated of these problems consisted of finding the 
curve or locus traced by a point whose distances from 
several fixed lines satisfied a given relation. 

Fermat adopted Viete’s notation in his paper “Ad 
Locos Planos et Solidos Isagoge” (1636; “Introduction 
to Plane and Solid Loci’). The title of the paper refers 
to the ancient classification of curves as plane (straight 
lines, circles), solid (ellipses, parabolas, and hyperbolas), 
or linear (curves defined kinematically or by a locus con- 
dition). Fermat considered an equation among two vari- 
ables. One of the variables represented a line measured 
horizontally from a given initial point, while the other 
represented a second line positioned at the end of the 
first line and inclined at a fixed angle to the horizontal. 
As the first variable varied in magnitude, the second took 
on a value determined by the equation, and the endpoint 
of the second line traced out a curve in space. By means 
of this construction Fermat was able to formulate the 
fundamental principle of analytic geometry: 


Whenever two unknown quantities are found in final 
equality, there results a locus fixed in place, and the 
endpoint of one of these unknown quantities describes 
a straight line or a curve. 


The principle implied a correspondence between two dif- 
ferent classes of mathematical objects: geometric curves 
and algebraic equations. In the paper of 1636 Fermat 
showed that, if the equation is a quadratic, then the 
curve is a conic section—that is, an ellipse, parabola, or 
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hyperbola. He also showed that the determination of the 
curve given by an equation is simplified by a transfor- 
mation involving a change of variables to an equation in 
standard form. 

Descartes’s La Géométrie appeared in 1637 as an 
appendix to his famous Discourse on Method, the treatise 
that presented the foundation of his philosophical sys- 
tem. Although supposedly an example from mathemat- 
ics of his rational method, La Géométrie was a technical 
treatise understandable independently of philosophy. 
It was destined to become one of the most influential 
books in the history of mathematics. 

In the opening sections of La Géométrie, Descartes 
introduced two innovations. In place of Viéete’s notation 
he initiated the modern practice of denoting variables 
by letters at the end of the alphabet (x, y, z) and param- 
eters by letters at the beginning of the alphabet (a, 5, c) 
and of using exponential notation to indicate powers 
of x (x, x,. ..). More significant conceptually, he set aside 
Viéte’s principle of homogeneity, showing by means of 
a simple construction how to represent multiplication 
and division of lines by lines; thus, all magnitudes (lines, 
areas, and volumes) could be represented independently 
of their dimension in the same way. 

Descartes’s goal in La Géomeétrie was to achieve 
the construction of solutions to geometric problems by 
means of instruments that were acceptable generaliza- 
tions of ruler and compass. Algebra was a tool to be 
used in this program: 
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If, then, we wish to solve any problem, we first suppose 
the solution already effected, and give names to all 
the lines that seem necessary for its construction— 
to those that are unknown as well as to those that 
are known. Then, making no distinction in any way 
between known and unknown lines, we must unravel 
the difficulty in any way that shows most naturally 
the relations between these lines, until we find it possi- 
ble to express a single quantity in two ways. This will 
constitute an equation, since the terms of one of these 
two expressions are together equal to the terms of 
the other. 

In the problem of Apollonius, for example, one 
sought to find the locus of points whose distances from 
a collection of fixed lines satisfied a given relation. One 
used this relation to derive an equation, and then, using 
a geometric procedure involving acceptable instruments 
of construction, one obtained points on the curve given 
by the roots of the equation. 

Descartes described instruments more general than 
the compass for drawing “geometric” curves. He stipu- 
lated that the parts of the instrument be linked together 
so that the ratio of the motions of the parts could be 
knowable. This restriction excluded “mechanical” curves 
generated by kinematic processes. The Archimedean spi- 
ral, for example, was generated by a point moving on 
a line as the line rotated uniformly about the origin. 
The ratio of the circumference to the diameter did not 
permit exact determination: 
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the ratios between straight and curved lines are not 
known, and I even believe cannot be discovered by men, 
and therefore no conclusion based upon such ratios can 
be accepted as rigorous and exact. 


Descartes concluded that a geometric or nonme- 
chanical curve was one whose equation f(x, y) = 0 was a 
polynomial of finite degree in two variables. He wished to 
restrict mathematics to the consideration of such curves. 

Descartes’s emphasis on construction reflected his 
classical orientation. His conservatism with respect to 
what curves were acceptable in mathematics further 
distinguished him as a traditional thinker. At the time 
of his death, in 1650, he had been overtaken by events, 
as research moved away from questions of construc- 
tion to problems of finding areas (then called problems 
of quadrature) and tangents. The geometric objects 
that were then of growing interest were precisely the 
mechanical curves that Descartes had wished to banish 
from mathematics. 

Following the important results achieved in the 
16th century by Gerolamo Cardano and the Italian 
algebraists, the theory of algebraic equations reached 
an impasse. The ideas needed to investigate equations 
of degree higher than four were slow to develop. The 
immediate historical influence of Viéte, Fermat, and 
Descartes was to furnish algebraic methods for the 
investigation of curves. A vigorous school of research 
became established in Leiden around Frans van Schoo- 
ten, a Dutch mathematician who edited and published 
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in 1649 a Latin translation of La Géométrie. Van Schoo- 
ten published a second two-volume translation of the 
same work in 1659-1661 that also contained math- 
ematical appendixes by three of his disciples, Johan 
de Witt, Johan Hudde, and Hendrick van Heuraet. 
The Leiden group of mathematicians, which also 
included Christiaan Huygens, was in large part respon- 
sible for the rapid development of Cartesian geometry 
in the middle of the century. 


THE CALCULUS 


The historian Carl Boyer called the calculus “the most 
effective instrument for scientific investigation that 
mathematics has ever produced.” As the mathematics 
of variability and change, the calculus was the char- 
acteristic product of the scientific revolution. The 
subject was properly the invention of two mathemati- 
cians, the German Gottfried Wilhelm Leibniz and the 
Englishman Isaac Newton. Both men published their 
researches in the 1680s, Leibniz in 1684 in the recently 
founded journal Acta Eruditorum and Newton in 1687 
in his great treatise, the Principia. Although a bitter dis- 
pute over priority developed later between followers of 
the two men, it is now clear that they each arrived at the 
calculus independently. 

The calculus developed from techniques to solve two 
types of problems, the determination of areas and vol- 
umes and the calculation of tangents to curves. In clas- 
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sical geometry Archimedes had advanced farthest in this 
part of mathematics, having used the method of exhaus- 
tion to establish rigorously various results on areas and 
volumes and having derived for some curves (e.g., the spi- 
ral) significant results concerning tangents. In the early 
17th century there was a sharp revival of interest in both 
classes of problems. The decades between 1610 and 1670, 
referred to in the history of mathematics as “the precal- 
culus period,” were a time of remarkable activity in which 
researchers throughout Europe contributed novel solu- 
tions and competed with each other to arrive at important 
new methods. 


THE PRECALCULUS PERIOD 


In his treatise Geometria Indivisibilibus Continuo- 
rum (1635; “Geometry of Continuous Indivisi- 
bles”), Bonaventura Cavalieri, a professor of math- 
ematics at the University of Bologna, formulated 
a systematic method for the determination of areas 
and volumes. As had Archimedes, Cavalieri regarded 
a plane figure as being composed of a collection of 
indivisible lines, “all the lines” of the plane figure. 
The collection was generated by a fixed line moving 
through space parallel to itself. Cavalieri showed that 
these collections could be interpreted as magnitudes 
obeying the rules of Euclidean ratio theory. In propo- 
sition 4 of Book IJ, he derived the result that is written 
today as 
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Let there be given a parallelogram in which a diagonal 
is drawn; then “all the squares” of the parallelogram 
will be triple “all the squares” of each of the triangles 
determined by the diagonal. 


Cavalieri showed that this proposition could be inter- 
preted in different ways—as asserting, for example, that 
the volume of a cone is one-third the volume of the cir- 
cumscribed cylinder or that the area under a segment of a 
parabola is one-third the area of the associated rectangle. 
In a later treatise he generalized the result by proving 


Bonaventura Cavalieri observed that figures 
(solids) of equal height and in which all 
corresponding cross sections match in length 
(area) are of equal area (volume). 
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forn=3ton=9. To establish these results, he introduced 
transformations among the variables of the problem, 
using a result equivalent to the binomial theorem for 
integral exponents. The ideas involved went beyond any- 
thing that had appeared in the classical Archimedean 
theory of content. 

Although Cavalieri was successful in formulating a 
systematic method based on general concepts, his ideas 
were not easy to apply. The derivation of very simple 
results required intricate geometric considerations, and 
the turgid style of the Geometria Indivisibilibus was a 
barrier to its reception. 

John Wallis presented a quite different approach 
to the theory of quadratures in his Arithmetica Infini- 
torum (1655; The Arithmetic of Infinitesimals). Wallis, a 
successor to Henry Briggs as the Savilian Professor of 
Geometry at Oxford, was a champion of the new meth- 
ods of arithmetic algebra that he had learned from his 
teacher William Oughtred. Wallis expressed the area 
under a curve as the sum of an infinite series and used 
clever and unrigorous inductions to determine its value. 
To calculate the area under the parabola, 


1 
| xix, 
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he considered the successive sums 
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and inferred by “induction” the general relation 
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By letting the number of terms be infinite, he 
obtained 1/3 as the limiting value of the expression. 
With more complicated curves he achieved very impres- 
sive results, including the infinite expression now known 
as Wallis’s product: 


Research on the determination of tangents, the other 
subject leading to the calculus, proceeded along different 
lines. In La Géométrie Descartes had presented a method 
that could in principle be applied to any algebraic or 
“geometric” curve—i.e., any curve whose equation was a 
polynomial of finite degree in two variables. The method 
depended upon finding the normal, the line perpendicu- 
lar to the tangent, using the algebraic condition that it be 
the unique radius to intersect the curve in only one point. 
Descartes’s method was simplified by Hudde, a member 
of the Leiden group of mathematicians, and was pub- 
lished in 1659 in van Schooten’s edition of La Géométrie. 

A class of curves of growing interest in the 17th cen- 
tury comprised those generated kinematically by a point 
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A cycloid is produced by a point on the circumference of a 


circle as the circle rolls along a straight line. 


moving through space. The famous cycloidal curve, for 
example, was traced by a point on the perimeter of a wheel 
that rolled on a line without slipping or sliding. These 
curves were nonalgebraic and hence could not be treated 
by Descartes’s method. Gilles Personne de Roberval, pro- 
fessor at the Collége Royale in Paris, devised a method 
borrowed from dynamics to determine their tangents. In 
his analysis of projectile motion Galileo had shown that 
the instantaneous velocity of a particle is compounded 
of two separate motions: a constant horizontal motion 
and an increasing vertical motion due to gravity. If the 
motion of the generating point of a kinematic curve is 
likewise regarded as the sum of two velocities, then the 
tangent will lie in the direction of their sum. Roberval 
applied this idea to several different kinematic curves, 
obtaining results that were often ingenious and elegant. 
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In an essay of 1636 circulated among French math- 
ematicians, Fermat presented a method of tangents 
adapted from a procedure he had devised to determine 
maxima and minima and used it to find tangents to several 
algebraic curves of the form y = x’. His account was short 
and contained no explanation of the mathematical basis 
of the new method. It is possible to see in his procedure an 
argument involving infinitesimals, and Fermat has some- 
times been proclaimed the discoverer of the differential 
calculus. Modern historical study, however, suggests that 
he was working with concepts introduced by Viéte and 
that his method was based on finite algebraic ideas. 


——————— 
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Fermat's tangent method 
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Isaac Barrow, the Lucasian Professor of Mathematics 
at the University of Cambridge, published in 1670 his Geo- 
metrical Lectures, a treatise that more than any other antic- 
ipated the unifying ideas of the calculus. In it he adopted 
a purely geometric form of exposition to show how the 
determinations of areas and tangents are inverse problems. 
He began with a curve and considered the slope of its tan- 
gent corresponding to each value of the abscissa. He then 
defined an auxiliary curve by the condition that its ordinate 
be equal to this slope and showed that the area under the 
auxiliary curve corresponding to a given abscissa is equal to 
the rectangle whose sides are unity and the ordinate of the 
original curve. When reformulated analytically, this result 
expresses the inverse character of differentiation and inte- 
gration, the fundamental theorem of the calculus. Although 
Barrow’s decision to proceed geometrically prevented him 
from taking the final step to a true calculus, his lectures 
influenced both Newton and Leibniz. 


NEWTON AND LEIBNIZ 


The essential insight of Newton and Leibniz was to use 
Cartesian algebra to synthesize the earlier results and to 
develop algorithms that could be applied uniformly to a 
wide class of problems. The formative period of New- 
ton’s researches was from 1665 to 1670, while Leibniz 
worked a few years later, in the 1670s. Their contribu- 
tions differ in origin, development, and influence, and it 
is necessary to consider each man separately. 
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The fundamental theorem of calculus 


Newton, the son of an English farmer, became in 
1669 the Lucasian Professor of Mathematics at the Uni- 
versity of Cambridge. Newton’s earliest researches in 
mathematics grew in 1665 from his study of van Schoo- 
ten’s edition of La Géomeétrie and Wallis’s Arithmetica 
Infinitorum. Using the Cartesian equation of the curve, 
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he reformulated Wallis’s results, introducing for this 
purpose infinite sums in the powers of an unknown x, 
now known as infinite series. Possibly under the influ- 
ence of Barrow, he used infinitesimals to establish for 
various curves the inverse relationship of tangents and 
areas. The operations of differentiation and integration 
emerged in his work as analytic processes that could be 
applied generally to investigate curves. 

Unusually sensitive to questions of rigour, Newton 
at a fairly early stage tried to establish his new method on 
a sound foundation using ideas from kinematics. A vari- 
able was regarded as a “fluent,” a magnitude that flows 
with time; its derivative or rate of change with respect to 
time was called a “fluxion,” denoted by the given variable 
with a dot above it. The basic problem of the calculus was 
to investigate relations among fluents and their fluxions. 
Newton finished a treatise on the method of fluxions as 
early as 1671, although it was not published until 1736. 
In the 18th century this method became the preferred 
approach to the calculus among British mathematicians, 
especially after the appearance in 1742 of Colin Maclau- 
rin’s influential Treatise of Fluxions. 

Newton first published the calculus in Book I of 
his great Philosophiae Naturalis Principia Mathemat- 
ica (1687; Mathematical Principles of Natural Philoso- 
phy). Originating as a treatise on the dynamics of par- 
ticles, the Principia presented an inertial physics that 
combined Galileo’s mechanics and Kepler’s planetary 
astronomy. It was written in the early 1680s at a time 
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when Newton was reacting against Descartes’s science 
and mathematics. Setting aside the analytic method of 
fluxions, Newton introduced in 11 introductory lemmas 
his calculus of first and last ratios, a geometric the- 
ory of limits that provided the mathematical basis of 
his dynamics. 

Newton’s use of the calculus in the Principia is illus- 
trated by proposition 11 of Book I: if the orbit of a parti- 
cle moving under a centripetal force is an ellipse with the 
centre of force at one 
focus, then the force 
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from the force centre to the particle) and the tangent to 
the curve at a point. This result expressed geometrically 
the proportionality of force to vector acceleration. Using 
properties of the ellipse known from classical geometry, 
Newton calculated the limit of this measure and showed 
that it was equal to a constant times | over the square of 
the radius. 

Newton avoided analytic processes in the Prin- 
cipia by expressing magnitudes and ratios directly in 
terms of geometric quantities, both finite and infin- 
itesimal. His decision to eschew analysis constituted a 
striking rejection of the algebraic methods that had been 
important in his own early researches on the calculus. 
Although the Principia was of inestimable value for later 
mechanics, it would be reworked by researchers on the 
Continent and expressed in the mathematical idiom of 
the Leibnizian calculus. 

Leibniz’s interest in mathematics was aroused in 
1672 during a visit to Paris, where the Dutch mathemati- 
cian Christiaan Huygens introduced him to his work on 
the theory of curves. Under Huygens’s tutelage Leibniz 
immersed himself for the next several years in the study 
of mathematics. He investigated relationships between 
the summing and differencing of finite and infinite 
sequences of numbers. Having read Barrow’s geometric 
lectures, he devised a transformation rule to calculate 
quadratures, obtaining the famous infinite series for 1/4: 
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Leibniz was interested in questions of logic and nota- 
tion, of how to construct a characteristica universalis for 
rational investigation. After considerable experimenta- 
tion he arrived by the late 1670s at an algorithm based 
on the symbols d and J. He first published his research 
on differential calculus in 1684 in an article in the Acta 
Eruditorum, “Nova Methodus pro Maximis et Minimis, 
Itemque Tangentibus, qua nec Fractas nec Irrationales 
Quantitates Moratur, et Singulare pro illi Calculi Genus” 
(“A New Method for Maxima and Minima as Well as 
Tangents, Which Is Impeded Neither by Fractional nor 
by Irrational Quantities, and a Remarkable Type of Cal- 
culus for This’). In this article he introduced the differ- 
ential dx satisfying the rules d(x + y) = dx + dy and d(xy) 
= xdy + ydx and illustrated his calculus with a few exam- 
ples. Two years later he published a second article, “On 
a Deeply Hidden Geometry,” in which he introduced 
and explained the symbol J for integration. He stressed 
the power of his calculus to investigate transcendental 
curves, the very class of “mechanical” objects Descartes 
had believed lay beyond the power of analysis, and 
derived a simple analytic formula for the cycloid. 

Leibniz continued to publish results on the new cal- 
culus in the Acta Eruditorum and began to explore his 
ideas in extensive correspondence with other scholars. 
Within a few years he had attracted a group of research- 
ers to promulgate his methods, including the broth- 
ers Johann Bernoulli and Jakob Bernoulli in Basel and 
the priest Pierre Varignon and Guillaume-Fran¢ois-An- 
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toine de L’Hospital in Paris. In 1700 he persuaded Fred- 
erick William I of Prussia to establish the Brandenburg 
Society of Sciences (later renamed the Berlin Academy 
of Sciences), with himself appointed president for life. 
Leibniz’s vigorous espousal of the new calculus, the 
didactic spirit of his writings, and his ability to attract 
a community of researchers contributed to his enor- 
mous influence on subsequent mathematics. In contrast, 
Newton’s slowness to publish and his personal reticence 
resulted in a reduced presence within European mathe- 
matics. Although the British school in the 18th century 
included capable researchers, Abraham de Moivre, James 
Stirling, Brook Taylor, and Maclaurin among them, they 
failed to establish a program of research comparable to 
that established by Leibniz’s followers on the Continent. 
There is a certain tragedy in Newton’s isolation and his 
reluctance to acknowledge the superiority of continental 
analysis. As the historian Michael Mahoney observed: 


Whatever the revolutionary influence of the Principia, 
mathematics would have looked much the same if New- 
ton had never existed. In that endeavour he belonged to 
a community, and he was far from indispensable to it. 


INSTITUTIONAL BACKGROUND 


After 1700 a movement to found learned societies on the 
model of Paris and London spread throughout Europe 
and the American colonies. The academy was the pre- 
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dominant institution of science until it was displaced by 
the university in the 19th century. The leading mathema- 
ticians of the period, such as Leonhard Euler, Jean Le 
Rond d’Alembert, and Joseph-Louis Lagrange, pursued 
academic careers at St. Petersburg, Paris, and London. 
The French Academy of Sciences (Paris) provides 
an informative study of the 18th-century learned society. 
The academy was divided into six sections, three for the 
mathematical and three for the physical sciences. The 
mathematical sections were for geometry, astronomy, and 
mechanics, the physical sections for chemistry, anatomy, 
and botany. Membership in the academy was divided 
by section, with each section contributing three pension- 
naires, two associates, and two adjuncts. There was also 
a group of free associates, distinguished men of science 
from the provinces, and foreign associates, eminent 
international figures in the field. A larger group of 70 
corresponding members had partial privileges, including 
the right to communicate reports to the academy. The 
administrative core consisted of a permanent secretary, 
treasurer, president, and vice president. In a given year 
the average total membership in the academy was 153. 
Prominent characteristics of the academy included 
its small and elite membership, made up heavily of men 
from the middle class, and its emphasis on the mathe- 
matical sciences. In addition to holding regular meetings 
and publishing memoirs, the academy organized scien- 
tific expeditions and administered prize competitions on 
important mathematical and scientific questions. 
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The historian Roger Hahn noted that the academy 
in the 18th century allowed “the coupling of relative doc- 
trinal freedom on scientific questions with rigorous eval- 
uations by peers,” an important characteristic of modern 
professional science. Academic mathematics and science 
did, however, foster a stronger individualistic ethos than 
is usual today. A determined individual such as Euler or 
Lagrange could emphasize a given program of research 
through his own work, the publications of the academy, 
and the setting of the prize competitions. The academy 
as an institution may have been more conducive to the 
solitary patterns of research in a theoretical subject like 
mathematics than it was to the experimental sciences. 
The separation of research from teaching is perhaps the 
most striking characteristic that distinguished the acad- 
emy from the model of university-based science that 
developed in the 19th century. 


ANALYsIsS AND MECHANICS 


The scientific revolution had bequeathed to mathematics 
a major program of research in analysis and mechanics. 
The period from 1700 to 1800, “the century of analy- 
sis,’ witnessed the consolidation of the calculus and 
its extensive application to mechanics. With expansion 
came specialization as different parts of the subject 
acquired their own identity: ordinary and partial differ- 
ential equations, calculus of variations, infinite series, 


and differential geometry. The applications of analysis 
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were also varied, including the theory of the vibrating 
string, particle dynamics, the theory of rigid bodies, the 
mechanics of flexible and elastic media, and the theory 
of compressible and incompressible fluids. Analysis and 
mechanics developed in close association, with problems 
in One giving rise to concepts and techniques in the other, 
and all the leading mathematicians of the period made 
important contributions to mechanics. 

The close relationship between mathematics and 
mechanics in the 18th century had roots extending 
deep into Enlightenment thought. In the organizational 
chart of knowledge at the beginning of the preliminary 
discourse to the Encyclopédie, Jean Le Rond d’Alem- 
bert distinguished between “pure” mathematics (geom- 
etry, arithmetic, algebra, calculus) and “mixed” mathe- 
matics (mechanics, geometric astronomy, optics, art of 
conjecturing). Mathematics generally was classified as 
a “science of nature” and separated from logic, a “sci- 
ence of man.” The modern disciplinary division between 
physics and mathematics and the association of the lat- 
ter with logic had not yet developed. 

Mathematical mechanics itself as it was practiced 
in the 18th century differed in important respects from 
later physics. The goal of modern physics is to explore 
the ultimate particulate structure of matter and to arrive 
at fundamental laws of nature to explain physical phe- 
nomena. The character of applied investigation in the 
18th century was rather different. The material parts of 
a given system and their interrelationship were idealized 
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for the purposes of analysis. A material object could be 
treated as a point-mass (a mathematical point at which 
it is assumed all the mass of the object is concentrated), 
as a rigid body, as a continuously deformable medium, 
and so on. The intent was to obtain a mathematical 
description of the macroscopic behaviour of the system 
rather than to ascertain the ultimate physical basis of the 
phenomena. In this respect the 18th-century viewpoint 
is closer to modern mathematical engineering than it is 
to physics. 

Mathematical research in the 18th century was 
coordinated by the Paris, Berlin, and St. Petersburg 
academies, as well as by several smaller provincial sci- 
entific academies and societies. Although England and 
Scotland were important centres early in the century, 
with Maclaurin’s death in 1746 the British flame was all 
but extinguished. 


History oF ANALYSIS 


The history of analysis in the 18th century can be fol- 
lowed in the official memoirs of the academies and in 
independently published expository treatises. In the first 
decades of the century the calculus was cultivated in an 
atmosphere of intellectual excitement as mathematicians 
applied the new methods to a range of problems in the 
geometry of curves. The brothers Johann and Jakob 
Bernoulli showed that the shape of a smooth wire along 
which a particle descends in the least time 1s the cycloid, 
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a transcendental curve much studied in the previous cen- 
tury. Working in a spirit of keen rivalry, the two brothers 
arrived at ideas that would later develop into the calcu- 
lus of variations. In his study of the rectification of the 
lemniscate, a ribbon-shaped curve discovered by Jakob 
Bernoulli in 1694, Giulio Carlo Fagnano (1682-1766) 
introduced ingenious analytic transformations that laid 
the foundation for the theory of elliptic integrals. Niko- 
laus I Bernoulli (1687-1759), the nephew of Johann 
and Jakob, proved the equality of mixed second-order 
partial derivatives and made important contributions 
to differential equations by the construction of orthog- 
onal trajectories to families of curves. Pierre Varignon 
(1654-1722), Johann Bernoulli, and Jakob Hermann 
(1678-1733) continued to develop analytic dynamics as 
they adapted Leibniz’s calculus to the inertial mechanics 
of Newton’s Principia. 

Geometric conceptions and problems predominated 
in the early calculus. This emphasis on the curve as the 
object of study provided coherence to what was otherwise 
a disparate collection of analytic techniques. With its 
continued development, the calculus gradually became 
removed from its origins in the geometry of curves, and 
a movement emerged to establish the subject on a purely 
analytic basis. In a series of textbooks published in the 
middle of the century, the Swiss mathematician Leon- 
hard Euler systematically accomplished the separation 
of the calculus from geometry. In his Jntroductio in Ana- 
lysin Infinitorum (1748; Introduction to the Analysis of 
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the Infinite), he made the notion of function the central 
organizing concept of analysis: 


A function of a variable quantity is an analytical 
expression composed in any way from the variable and 
from numbers or constant quantities. 


Euler’s analytic approach is illustrated by his intro- 
duction of the sine and cosine functions. Trigonome- 
try tables had existed since antiquity, and the relations 
between sines and cosines were commonly used in math- 
ematical astronomy. In the early calculus mathemati- 
cians had derived in their study of periodic mechanical 
phenomena the differential equation 


dy l 
d& Jl 


and they were able to interpret its solution geometrically 
in terms of lines and angles in the circle. Euler was the 
first to introduce the sine and cosine functions as quan- 
tities whose relation to other quantities could be studied 
independently of any geometric diagram. 

Euler’s analytic approach to the calculus received 
support from his younger contemporary Joseph-Louis 
Lagrange, who, following Euler’s death in 1783, replaced 
him as the leader of European mathematics. In 1755 the 
19-year-old Lagrange wrote to Euler to announce the dis- 
covery of a new algorithm in the calculus of variations, a 
subject to which Euler had devoted an important treatise 
11 years earlier. Euler had used geometric ideas exten- 
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sively and had emphasized the need for analytic meth- 
ods. Lagrange’s idea was to introduce the new symbol 6 
into the calculus and to experiment formally until he had 
devised an algorithm to obtain the variational equations. 
Mathematically quite distinct from Euler’s procedure, 
his method required no reference to the geometric con- 
figuration. Euler immediately adopted Lagrange’s idea, 
and in the next several years the two men systematically 
revised the subject using the new techniques. 

In 1766 Lagrange was invited by the Prussian 
king, Frederick the Great, to become mathematics 
director of the Berlin Academy. During the next two 
decades he wrote important memoirs on nearly all of the 
major areas of mathematics. In 1788 he published his 
famous Mécanique analytique, a treatise that used varia- 
tional ideas to present mechanics from a unified analytic 
viewpoint. In the preface Lagrange wrote: 


One will find no Figures in this Work. The methods that 
I present require neither constructions nor geometrical 
or mechanical reasonings, but only algebraic opera- 
tions, subject to a regular and uniform course. Those 
who admire Analysis, will with pleasure see Mechanics 
become a new branch of it, and will be grateful to me 
for having extended its domain. 

Following the death of Frederick the Great, 
Lagrange traveled to Paris to become a pensionnaire of 
the Academy of Sciences. With the establishment of the 
Ecole Polytechnique (French: “Polytechnic School”) in 
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1794, he was asked to deliver the lectures on mathe- 
matics. There was a concern in European mathematics 
at the time to place the calculus on a sound basis, and 
Lagrange used the occasion to develop his ideas for 
an algebraic foundation of the subject. The lectures were 
published in 1797 under the title Théorie des fonctions 
analytiques (“Theory of Analytical Functions’), a trea- 
tise whose contents were summarized in its longer title, 
“Containing the Principles of the Differential Calculus 
Disengaged from All Consideration of Infinitesimals, 
Vanishing Limits, or Fluxions and Reduced to the 
Algebraic Analysis of Finite Quantities.” Lagrange 
published a second treatise on the subject in 1801, a 
work that appeared in a revised and expanded form 
in 1806. 

The range of subjects presented and the con- 
sistency of style distinguished Lagrange’s didactic 
writings from other contemporary expositions of 
the calculus. He began with Euler’s notion of a func- 
tion as an analytic expression composed of variables 
and constants. He defined the “derived function,” 
or derivative f(x) of f(x), to be the coefficient of 7 in 
the Taylor expansion of f(x + i). Assuming the general 
possibility of such expansions, he attempted a rather 
complete theory of the differential and integral calcu- 
lus, including extensive applications to geometry and 
mechanics. Lagrange’s lectures represented the most 
advanced development of the 18th-century analytic 
conception of the calculus. 
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Beginning with Baron Cauchy in the 1820s, later 
mathematicians used the concept of limit to establish the 
calculus on an arithmetic basis. The algebraic viewpoint 
of Euler and Lagrange was rejected. To arrive at a proper 
historical appreciation of their work, it 1s necessary to 
reflect on the meaning of analysis in the 18th century. 
Since Viéete, analysis had referred generally to mathe- 
matical methods that employed equations, variables, and 
constants. With the extensive development of the calculus 
by Leibniz and his school, analysis became identified with 
all calculus-related subjects. In addition to this historical 
association, there was a deeper sense in which analytic 
methods were fundamental to the new mathematics. An 
analytic equation implied the existence of a relation that 
remained valid as the variables changed continuously in 
magnitude. Analytic algorithms and transformations 
presupposed a correspondence between local and global 
change, the basic concern of the calculus. It is this aspect 
of analysis that fascinated Euler and Lagrange and caused 
them to see in it the “true metaphysics” of the calculus. 


OTHER DEVELOPMENTS 


During the period 1600-1800 significant advances occurred 
in the theory of equations, foundations of Euclidean geom- 
etry, number theory, projective geometry, and probability 
theory. These subjects, which became mature branches of 
mathematics only in the 19th century, never rivaled analy- 
sis and mechanics as programs of research. 


THE History OF MATHEMATICS 


THEORY OF EQUATIONS 


After the dramatic successes of Niccolo Fontana Tarta- 
glia and Lodovico Ferrari in the 16th century, the the- 
ory of equations developed slowly, as problems resisted 
solution by known techniques. In the later 18th century 
the subject experienced an infusion of new ideas. Inter- 
est was concentrated on two problems. The first was to 
establish the existence of a root of the general polyno- 
mial equation of degree n. The second was to express the 
roots as algebraic functions of the coefficients or to show 
why it was not, in general, possible to do so. 

The proposition that the general polynomial with 
real coefficients has a root of the form a + b\—1 became 
known later as the fundamental theorem of algebra. 
By 1742 Euler had recognized that roots appear in con- 
jugate pairs; if a + by—1 is a root, then so is a — by-1. 
Thus, if a + by-1 is a root of f(x) = 0, then f(x) = 
(x —2ax-a - b )g(x). The fundamental theorem was 
therefore equivalent to asserting that a polynomial may 
be decomposed into linear and quadratic factors. This 
result was of considerable importance for the theory of 
integration, since by the method of partial fractions it 
ensured that a rational function, the quotient of two 
polynomials, could always be integrated in terms of alge- 
braic and elementary transcendental functions. 

Although d’Alembert, Euler, and Lagrange worked 
on the fundamental theorem, the first successful proof 
was developed by Carl Friedrich Gauss in his doctoral 
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dissertation of 1799. Earlier researchers had investigated 
special cases or had concentrated on showing that all pos- 
sible roots were of the form a + by—1. Gauss tackled the 
problem of existence directly. Expressing the unknown 
in terms of the polar coordinate variables r and 0, he 
showed that a solution of the polynomial would lie at 
the intersection of two curves of the form 7(r, 0) = 0 
and U(r, 8) = 0. By a careful and rigorous investigation 
he proved that the two curves intersect. 

Gauss’s demonstration of the fundamental theorem 
initiated a new approach to the question of mathemat- 
ical existence. In the 18th century mathematicians were 
interested in the nature of particular analytic processes 
or the form that given solutions should take. Mathemati- 
cal entities were regarded as things that were given, not as 
things whose existence needed to be established. Because 
analysis was applied in geometry and mechanics, the 
formalism seemed to possess a clear interpretation that 
obviated any need to consider questions of existence. 
Gauss’s demonstration was the beginning of a change of 
attitude in mathematics, of a shift to the rigorous, inter- 
nal development of the subject. 

The problem of expressing the roots of a polyno- 
mial as functions of the coefficients was addressed by 
several mathematicians independently about 1770. The 
Cambridge mathematician Edward Waring published 
treatises in 1762 and 1770 on the theory of equations. In 
1770 Lagrange presented a long expository memoir on 
the subject to the Berlin Academy, and in 1771 Alexandre 
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Vandermonde submitted a paper to the French Acad- 
emy of Sciences. Although the ideas of the three men 
were related, Lagrange’s memoir was the most extensive 
and most influential historically. 

Lagrange presented a detailed analysis of the solu- 
tion by radicals of second-, third-, and fourth-degree 
equations and investigated why these solutions failed 
when the degree was greater than or equal to five. He 
introduced the novel idea of considering functions of 
the roots and examining the values they assumed as 
the roots were permuted. He was able to show that the 
solution of an equation depends on the construction 
of a second resolvent equation, but he was unable to 
provide a general procedure for solving the resolvent 
when the degree of the original equation was greater 
than four. Although his theory left the subject in an 
unfinished condition, it provided a solid basis for 
future work. The search for a general solution to the 
polynomial equation would provide the greatest sin- 
gle impetus for the transformation of algebra in the 
19th century. 

The efforts of Lagrange, Vandermonde, and War- 
ing illustrate how difficult it was to develop new con- 
cepts in algebra. The history of the theory of equations 
belies the view that mathematics is subject to almost 
automatic technical development. Much of the later 
algebraic work would be devoted to devising termi- 
nology, concepts, and methods necessary to advance 
the subject. 
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FOUNDATIONS OF GEOMETRY 


Although the emphasis of mathematics after 1650 was 
increasingly on analysis, foundational questions in clas- 
sical geometry continued to arouse interest. Attention 
centred on the fifth postulate of Book I of the Elements, 
which Euclid had used to prove the existence of a unique 
parallel through a point to a given line. Since antiquity, 
Greek, Islamic, and European geometers had attempted 
unsuccessfully to show that the parallel postulate need 
not be a postulate but could instead be deduced from 
the other postulates of Euclidean geometry. During the 
period 1600-1800 mathematicians continued these efforts 
by trying to show that the postulate was equivalent to 
some result that was considered self-evident. Although 
the decisive breakthrough to non-Euclidean geome- 
try would not occur until the 19th century, researchers 
did achieve a deeper and more systematic understanding 
of the classical properties of space. 

Interest in the parallel postulate developed in the 16th 
century after the recovery and Latin translation of Proclus’s 
commentary on Euclid’s Elements. The Italian researchers 
Christopher Clavius in 1574 and Giordano Vitale in 1680 
showed that the postulate is equivalent to asserting that 
the line equidistant from a straight line is a straight line. 
In 1693 John Wallis, Savilian Professor of Geometry at 
Oxford, attempted a different demonstration, proving that 
the axiom follows from the assumption that to every figure 
there exists a similar figure of arbitrary magnitude. 
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In 1733 the Italian Girolamo Saccheri published 
his Euclides ab Omni Naevo Vindicatus (“Euclid Cleared 
of Every Flaw”). This was an important work of synthe- 
sis in which he provided a complete analysis of the prob- 
lem of parallels in terms of Omar Khayyam’s quadrilat- 
eral. Using the Euclidean assumption that straight lines 
do not enclose an area, he was able to exclude geometries 
that contain no parallels. It remained to prove the exis- 
tence of a unique parallel through a point to a given line. 
To do this, Saccheri adopted the procedure of reductio 
ad absurdum; he assumed the existence of more than one 
parallel and attempted to derive a contradiction. After a 
long and detailed investigation, he was able to convince 
himself (mistakenly) that he had found the desired con- 
tradiction. 

In 1766 Johann Heinrich Lambert of the Berlin 
Academy composed Die Theorie der Parallellinien (“The 
Theory of Parallel Lines”; published 1786), a penetrat- 
ing study of the fifth postulate in Euclidean geometry. 
Among other theorems Lambert proved is that the par- 
allel axiom is equivalent to the assertion that the sum 
of the angles of a triangle is equal to two right angles. 
He combined this fact with Wallis’s result to arrive at an 
unexpected characterization of classical space. Accord- 
ing to Lambert, if the parallel postulate is rejected, it 
follows that for every angle 6 less than 2.R/3 (R is a right 
angle) an equilateral triangle can be constructed with 
corner angle 0. By Wallis’s result any triangle similar to 
this triangle must be congruent to it. It is therefore pos- 
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sible to associate with every angle a definite length, the 
side of the corresponding equilateral triangle. Since the 
measurement of angles is absolute, independent of any 
convention concerning the selection of units, it follows 
that an absolute unit of length exists. Hence, to accept 
the parallel postulate is to deny the possibility of an 
absolute concept of length. 

The final 18th-century contribution to the the- 
ory of parallels was Adrien-Marie Legendre’s text- 
book Eléments de géométrie (Elements of Geometry and 
Trigonometry), the first edition of which appeared in 
1794. Legendre presented an elegant demonstration 
that purported to show that the sum of the angles of 
a triangle is equal to two right angles. He believed that 
he had conclusively established the validity of the par- 
allel postulate. His work attracted a large audience and 
was influential in informing readers of the new ideas 
in geometry. 

The 18th-century failure to develop a non-Euclid- 
ean geometry was rooted in deeply held philosophical 
beliefs. In his Critique of Pure Reason (1781), Immanuel 
Kant had emphasized the synthetic a priori charac- 
ter of mathematical judgments. From this standpoint, 
statements of geometry and arithmetic were necessar- 
ily true propositions with definite empirical content. 
The existence of similar figures of different size, or the 
conventional character of units of length, appeared 
self-evident to mathematicians of the period. As late as 
1824 Pierre-Simon, marquis de Laplace, wrote: 
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Thus the notion of space includes a special property, 
self-evident, without which the properties of parallels 
cannot be rigorously established. The idea of a bounded 
region, e.g., the circle, contains nothing which depends 
on its absolute magnitude. But if we imagine its radius 
to diminish, we are brought without fail to the diminu- 
tion in the same ratio of its circumference and the sides 
of all the inscribed figures. This proportionality appears 
to me amore natural postulate than that of Euclid, and 
it is worthy of note that it is discovered afresh in the 
results of the theory of universal gravitation. 


MATHEMATICS IN THE 19TH CENTURY 


Most of the powerful abstract mathematical theories 
in use today originated in the 19th century, so any his- 
torical account of the period should be supplemented 
by reference to detailed treatments of these topics. Yet 
mathematics grew so much during this period that any 
account must necessarily be selective. Nonetheless, some 
broad features stand out. The growth of mathematics as 
a profession was accompanied by a sharpening division 
between mathematics and the physical sciences, and con- 
tact between the two subjects takes place today across a 
clear professional boundary. One result of this separa- 
tion has been that mathematics, no longer able to rely on 
its scientific import for its validity, developed markedly 
higher standards of rigour. It was also freed to develop 
in directions that had little to do with applicability. Some 
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of these pure creations have turned out to be surpris- 
ingly applicable, while the attention to rigour has led to 
a wholly novel conception of the nature of mathemat- 
ics and logic. Moreover, many outstanding questions in 
mathematics yielded to the more conceptual approaches 
that came into vogue. 


PROJECTIVE GEOMETRY 


The French Revolution provoked a radical rethinking 
of education in France, and mathematics was given a 
prominent role. The Ecole Polytechnique was established 
in 1794 with the ambitious task of preparing all can- 
didates for the specialist civil and military engineering 
schools of the republic. Mathematicians of the highest 
calibre were involved; the result was a rapid and sus- 
tained development of the subject. The inspiration for 
the Ecole was that of Gaspard Monge, who believed 
strongly that mathematics should serve the scientific and 
technical needs of the state. To that end he devised a syl- 
labus that promoted his own descriptive geometry, which 
was useful in the design of forts, gun emplacements, and 
machines and which was employed to great effect in the 
Napoleonic survey of Egyptian historical sites. 

In Monge’s descriptive geometry, three-dimensional 
objects are described by their orthogonal projections onto 
a horizontal and a vertical plane, the plan and elevation of 
the object. A pupil of Monge, Jean-Victor Poncelet, was 
taken prisoner during Napoleon’s retreat from Moscow 
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and sought to keep up his spirits while in jail in Saratov by 
thinking over the geometry he had learned. He dispensed 
with the restriction to orthogonal projections and decided 
to investigate what properties figures have in common 
with their shadows. There are several of these properties: 
a Straight line casts a straight shadow, and a tangent to a 
curve casts a shadow that is tangent to the shadow of the 
curve. But some properties are lost: the lengths and angles 
of a figure bear no relation to the lengths and angles of its 
shadow. Poncelet felt that the properties that survive are 
worthy of study, and, by considering only those proper- 
ties that a figure shares 
with all its shadows, 
Poncelet hoped to put 
truly geometric rea- 
soning on a par with 
algebraic geometry. 

In 1822 Poncelet 
published the Traité 
des propriétés projec- 
tives des figures (“Trea- 
tise on the Projective 
Propertiesof Figures”). 
From his standpoint 
every conic section is 
equivalent to a circle, 
so his treatise con- 
tained a unified treat- 
ment of the theory 


Jean-Victor Poncelet 
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of conic sections. It also established several new results. 
Geometers who took up his work divided into two groups: 
those who accepted his terms and those who, finding them 
obscure, reformulated his ideas in the spirit of algebraic 
geometry. On the algebraic side it was taken up in Germany 
by August Ferdinand Mobius, who seems to have come to 
his ideas independently of Poncelet, and then by Julius 
Pliicker. They showed how rich was the projective geom- 
etry of curves defined by algebraic equations and thereby 
gave an enormous boost to the algebraic study of curves, 
comparable to the original impetus provided by Descartes. 
Germany also produced synthetic projective geometers, 
notably Jakob Steiner (born in Switzerland but educated 
in Germany) and Karl Georg Christian von Staudt, who 
emphasized what can be understood about a figure from a 
careful consideration of all its transformations. 

Within the debates about projective geometry 
emerged one of the few synthetic ideas to be discovered 
since the days of Euclid, that of duality. This associates 
with each point a line and with each line a point, in such 
a way that (1) three points lying in a line give rise to three 
lines meeting in a point and, conversely, three lines meet- 
ing 1n a point give rise to three points lying on a line and 
(2) if one starts with a point (or a line) and passes to 
the associated line (point) and then repeats the process, 
one returns to the original point (line). One way of using 
duality (presented by Poncelet) is to pick an arbitrary 
conic and then to associate with a point P lying outside 
the conic the line that joins the points R and S at which 


THE History OF MATHEMATICS 
ST i ~~ 


the tangents through P to the conic touch the conic. 
A second method is needed for points on or inside the 
conic. The feature of duality that makes it so exciting 
is that one can apply it mechanically to every proof in 
geometry, interchanging “point” and line” and “collin- 
ear” and “concurrent” throughout, and so obtain a new 
result. Sometimes a result turns out to be equivalent to 
the original, sometimes to its converse, but at a single 
stroke the number of theorems was more or less doubled. 


MAKING THE CALCULUS RIGOROUS 


Monge’s educational ideas were opposed by Joseph- 
Louis Lagrange, who favoured a more traditional and 
theoretical diet of advanced calculus and rational 
mechanics (the application of the calculus to the study of 
the motion of solids and liquids). Eventually Lagrange 
won, and the vision of mathematics that was presented 
to the world was that of an autonomous subject that was 
also applicable to a broad range of phenomena by virtue 
of its great generality, a view that has persisted to the 
present day. 

During the 1820s Augustin-Louis, Baron Cauchy, 
lectured at the Ecole Polytechnique on the foundations 
of the calculus. Since its invention it had been generally 
agreed that the calculus gave correct answers, but no 
one had been able to give a satisfactory explanation of 
why this was so. Cauchy rejected Lagrange’s algebraic 
approach and proved that Lagrange’s basic assumption 
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that every function has a power series expansion is in 
fact false. Newton had suggested a geometric or dynamic 
basis for calculus, but this ran the risk of introducing a 
vicious circle when the calculus was applied to mechani- 
cal or geometric problems. Cauchy proposed basing the 
calculus on a sophisticated and difficult interpretation 
of the idea of two points or numbers being arbitrarily 
close together. Although his students disliked the new 
approach, and Cauchy was ordered to teach material 
that the students could actually understand and use, his 
methods gradually became established and refined to 
form the core of the modern rigorous calculus, a subject 
now called mathematical analysis. 

Traditionally, the calculus had been concerned with 
the two processes of differentiation and integration and the 
reciprocal relation that exists between them. Cauchy pro- 
vided a novel underpinning by stressing the importance of 
the concept of continuity, which is more basic than either. 
He showed that, once the concepts of a continuous func- 
tion and limit are defined, the concepts of a differentiable 
function and an integrable function can be defined in terms 
of them. Unfortunately, neither of these concepts is easy to 
grasp, and the much-needed degree of precision they bring 
to mathematics has proved difficult to appreciate. Roughly 
speaking, a function is continuous at a point in its domain 
if small changes in the input around the specified value pro- 
duce only small changes in the output. 

Thus, the familiar graph of a parabola y = x is 
continuous around the point x = 0; as x varies by small 
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amounts, so necessarily does y. On the other hand, the 
graph of the function that takes the value 0 when x is 
negative or zero, and the value 1 when x is positive, 
plainly has a discontinuous graph at the point x = 0, and 
it is indeed discontinuous there according to the defin1- 
tion. If x varies from 0 by any small positive amount, 
the value of the function jumps by the fixed amount 1, 
which is not an arbitrarily small amount. 

Cauchy said that a function f(x) tends to a lim- 
iting value 1 as x tends to the value a whenever the 
value of the difference f(x) — f(a) becomes arbitrarily 
small as the difference x — a itself becomes arbitrarily 
small. He then showed that if f(x) is continuous at a, 
the limiting value of the function as x tended to a was 
indeed f(a). The crucial feature of this definition is 
that it defines what it means for a variable quantity to 
tend to something entirely without reference to ideas 
of motion. 

Cauchy then said a function f(x) is differentia- 
ble at the point a if, as x tends to a (which it is never 
allowed to reach), the value of the quotient [f/(x) — f(a)]/ 
(x — a) tends to a limiting value, called the derivative 
of the function f(x) at a. To define the integral of a 
function f(x) between the values a and b, Cauchy went 
back to the primitive idea of the integral as the mea- 
sure of the area under the graph of the function. He 
approximated this area by rectangles and said that if 
the sum of the areas of the rectangles tends to a limit as 
their number increases indefinitely and if this limiting 
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value is the same however the rectangles are obtained, 
then the function 1s integrable. Its integral is the com- 
mon limiting value. After he had defined the integral 
independently of the differential calculus, Cauchy had 
to prove that the processes of integrating and differen- 
tiating are mutually inverse. This he did, giving for the 
first time a rigorous foundation to all the elementary 
calculus of his day. 


FOURIER SERIES 


The other crucial figure of the time in France was 
Joseph, Baron Fourier. His major contribution, pre- 
sented in The Analytical Theory of Heat (1822), was 
to the theory of heat diffusion in solid bodies. He pro- 
posed that any function could be written as an infinite 
sum of the trigonometric functions cosine and sine; 
for example, 
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Expressions of this kind had been written down 
earlier, but Fourier’s treatment was new in the degree 
of attention given to their convergence. He investigated 
the question “Given the function f(x), for what range 
of values of x does the expression above sum to a finite 
number?” It turned out that the answer depends on the 
coefficients a", and Fourier gave rules for obtaining them 
of the form 
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it 
dy = f(x) sin(nx) ax. 

=e 
Had Fourier’s work been entirely correct, it would 
have brought all functions into the calculus, making pos- 
sible the solution of many kinds of differential equations 
and greatly extending the theory of mathematical phys- 
ics. But his arguments were unduly naive: after Cauchy 
it was not clear that the function f(x) sin (nx) was neces- 
sarily integrable. When Fourier’s ideas were finally pub- 
lished, they were eagerly taken up, but the more cautious 
mathematicians, notably the influential German Peter 
Gustav Lejeune Dirichlet, wanted to rederive Fourier’s 
conclusions in a more rigorous way. Fourier’s methodol- 
ogy was widely accepted, but questions about its validity 
in detail were to occupy mathematicians for the rest of 

the century. 


ELuiptic FUNCTIONS 


The theory of functions of a complex variable was also 
being decisively reformulated. At the start of the 19th 
century, complex numbers were discussed from a qua- 
si-philosophical standpoint by several French writers, 
notably Jean-Robert Argand. A consensus emerged that 
complex numbers should be thought of as pairs of real 
numbers, with suitable rules for their addition and mul- 
tiplication so that the pair (0, 1) was a square root of —1 
(i). The underlying meaning of such a number pair was 
given by its geometric interpretation either as a point in 
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a plane or as a directed segment joining the coordinate 
origin to the point in question. (This representation 1s 
sometimes called the Argand diagram.) In 1827, while 
revising an earlier manuscript for publication, Cauchy 
showed how the problem of integrating functions of two 
variables can be illuminated by a theory of functions of 
a single complex variable, which he was then developing. 
But the decisive influence on the growth of the subject 
came from the theory of elliptic functions. 

The study of elliptic functions originated in the 
18th century, when many authors studied integrals of 
the form 
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where p(t) and g(t) are polynomials in ¢ and gq(f) 1s of 
degree 3 or 4 in ¢. Such integrals arise naturally, for 
example, as an expression for the length of an arc of 
an ellipse (whence the name). These integrals cannot be 
evaluated explicitly; they do not define a function that 
can be obtained from the rational and trigonometric 
functions, a difficulty that added to their interest. Ellip- 
tic integrals were intensively studied for many years 
by the French mathematician Adrien-Marie Legendre, 
who was able to calculate tables of values for such 
expressions as functions of their upper endpoint, x. But 
the topic was completely transformed in the late 1820s 
by the independent but closely overlapping discoveries 
of two young mathematicians, the Norwegian Niels 
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Henrik Abel and the German Carl Jacobi. These men 
showed that if one allowed the variable x to be com- 
plex and the problem was inverted, so that the object of 
study became 


f p(t)dt 

u= : 

9 va(t) 

considered as defining a function x of a variable u, then a 
remarkable new theory became apparent. The new func- 
tion, for example, possessed a property that generalized 
the basic property of periodicity of the trigonometric 
functions sine and cosine: sin (x) = sin (x + 2m). Any 
function of the kind just described has two distinct peri- 
ods, w, and @,: 


xi) = x (ie + w,) = lie + wy}. 


These new functions, the elliptic functions, aroused 
a considerable degree of interest. The analogy with trig- 
onometric functions ran very deep (indeed, the trigono- 
metric functions turned out to be special cases of elliptic 
functions), but their greatest influence was on the bur- 
geoning general study of functions of a complex variable. 
The theory of elliptic functions became the paradigm 
of what could be discovered by allowing variables to be 
complex instead of real. But their natural generalization 
to functions defined by more complicated integrands, 
although it yielded partial results, resisted analysis until 
the second half of the 19th century. 
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THE THEORY OF NUMBERS 


While the theory of elliptic functions typifies the 19th 
century’s enthusiasm for pure mathematics, some con- 
temporary mathematicians said that the simultaneous 
developments in number theory carried that enthusi- 
asm to excess. Nonetheless, during the 19th century the 
algebraic theory of numbers grew from being a minority 
interest to its present central importance in pure math- 
ematics. The earlier investigations of Pierre de Fermat 
had eventually drawn the attention of Leonhard Euler 
and Lagrange. Euler proved some of Fermat’s unproven 
claims and discovered many new and surprising facts; 
Lagrange not only supplied proofs of many remarks 
that Euler had merely conjectured but also worked them 
into something like a coherent theory. For example, it 
was known to Fermat that the numbers that can be writ- 
ten as the sum of two squares are the number 2, squares 
themselves, primes of the form 4” + 1, and products of 
these numbers. Thus, 29, which is 4 x 7 + 1, is 5 + 2, but 
35, which is not of this form, cannot be written as the 
sum of two squares. Euler had proved this result and had 
gone on to consider similar cases, such as primes of the 
form x + 2y orx + 3y.. But it was left to Lagrange to 
provide a general theory covering all expressions of the 
form ax + bxy+ cy, quadratic forms, as they are called. 

Lagrange’s theory of quadratic forms had made 
considerable use of the idea that a given quadratic form 
could often be simplified to another with the same prop- 
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erties but with smaller coefficients. To do this in prac- 
tice, it was often necessary to consider whether a given 
integer left a remainder that was a square when it was 
divided by another given integer. (For example, 48 leaves 
a remainder of 4 upon division by 11, and 4 is a square.) 
Legendre discovered a remarkable connection between 
the question “Does the integer p leave a square remain- 
der on division by qg?” and the seemingly unrelated ques- 
tion “Does the integer g leave a square remainder upon 
division by p?” He saw, in fact, that when p and gq are 
primes, both questions have the same answer unless both 
primes are of the form 4 — 1. Because this observation 
connects two questions in which the integers p and q play 
mutually opposite roles, 1t became known as the law of 
quadratic reciprocity. Legendre also gave an effective way 
of extending his law to cases when p and g are not prime. 

All this work set the scene for the emergence of 
Carl Friedrich Gauss, whose Disquisitiones Arithmeti- 
cae (1801) not only consummated what had gone before 
but also directed number theorists in new and deeper 
directions. He rightly showed that Legendre’s proof 
of the law of quadratic reciprocity was fundamentally 
flawed and gave the first rigorous proof. His work sug- 
gested that there were profound connections between 
the original question and other branches of number 
theory, a fact that he perceived to be of signal impor- 
tance for the subject. He extended Lagrange’s theory of 
quadratic forms by showing how two quadratic forms 
can be “multiplied” to obtain a third. Later mathema- 


ticlans were to rework this into an important example 
of the theory of finite commutative groups. And in the 
long final section of his book, Gauss gave the theory that 
lay behind his first discovery as a mathematician: that a 
regular 17-sided figure can be constructed by circle and 
straightedge alone. 

The discovery that the regular “17-gon” is so con- 
structible was the first such discovery since the Greeks, who 
had known only of the equilateral triangle, the square, the 
regular = pentagon, 
the regular 15-sided 
figure, and the figures 
that can be obtained 
from these by suc- 
cessively bisecting all 
the sides. But what 
was of much greater 
significance than the 
discovery was the the- 
ory that underpinned 
it, the theory of what 
are now called alge- 
braic numbers. It 
may be thought of 
as an analysis of how 
complicated a num- 
ber may be while yet 
being amenable to an 
exact treatment. 
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The simplest numbers to understand and use are the 
integers and the rational numbers. The irrational numbers 
seem to pose problems. Famous among these is 2. It can- 
not be written as a finite or repeating decimal (because it 
is not rational), but it can be manipulated algebraically 
very easily. It is necessary only to replace every occurrence 
of (2) by 2. In this way expressions of the form m + ny2, 
where m and n are integers, can be handled arithmetically. 
These expressions have many properties akin to those of 
whole numbers, and mathematicians have even defined 
prime numbers of this form; therefore, they are called alge- 
braic integers. In this case they are obtained by grafting 
onto the rational numbers a solution of the polynomial 
equation x — 2 = 0. In general an algebraic integer is any 
solution, real or complex, of a polynomial equation with 
integer coefficients in which the coefficient of the highest 
power of the unknown is 1. 

Gauss’s theory of algebraic integers led to the ques- 
tion of determining when a polynomial of degree 1 with 
integer coefficients can be solved given the solvability of 
polynomial equations of lower degree but with coeffi- 
cients that are algebraic integers. For example, Gauss 
regarded the coordinates of the 17 vertices of a regular 
17-sided figure as complex numbers satisfying the equa- 
tion x — 1=0 and thus as algebraic integers. One such 
integer is 1. He showed that the rest are obtained by solv- 
ing a succession of four quadratic equations. Because 
solving a quadratic equation is equivalent to performing 
a construction with a ruler and a compass, as Descartes 
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had shown long before, Gauss had shown how to con- 
struct the regular 17-gon. 

Inspired by Gauss’s works on the theory of num- 
bers, a growing school of mathematicians were drawn 
to the subject. Like Gauss, the German mathematician 
Ernst Eduard Kummer sought to generalize the law of 
quadratic reciprocity to deal with questions about third, 
fourth, and higher powers of fnumbers. He found that his 
work led him in an unexpected direction, toward a partial 
resolution of Fermat’s last theorem. In 1637 Fermat wrote 
in the margin of his copy of Diophantus’s Arithmetica the 
claim to have a proof that there are no solutions in posi- 
tive integers to the equation x. + y =z ifn > 2. However, 
no proof was ever discovered among his notebooks. 

Kummer’s approach was to develop the theory of 
algebraic integers. If it could be shown that the equation 
had no solution in suitable algebraic integers, then a for- 
tiori there could be no solution in ordinary integers. He 
was eventually able to establish the truth of Fermat’s last 
theorem for a large class of prime exponents n (those 
satisfying some technical conditions needed to make the 
proof work). This was the first significant breakthrough 
in the study of the theorem. Together with the earlier 
work of the French mathematician Sophie Germain, it 
enabled mathematicians to establish Fermat’s last theo- 
rem for every value of n from 3 to 4,000,000. However, 
Kummer’s way around the difficulties he encountered 
further propelled the theory of algebraic integers into 
the realm of abstraction. It amounted to the suggestion 
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that there should be yet other types of integers, but many 
found these ideas obscure. 

In Germany Richard Dedekind patiently created 
a new approach, in which each new number (called an 
ideal) was defined by means of a suitable set of algebraic 
integers in such a way that it was the common divisor of 
the set of algebraic integers used to define it. Dedekind’s 
work was slow to gain approval, yet it illustrates several 
of the most profound features of modern mathematics. 
It was clear to Dedekind that the ideal algebraic inte- 
gers were the work of the human mind. Their existence 
can be neither based on nor deduced from the existence 
of physical objects, analogies with natural processes, or 
some process of abstraction from more familiar things. 
A second feature of Dedekind’s work was its reliance on 
the idea of sets of objects, such as sets of numbers, even 
sets of sets. Dedekind’s work showed how basic the naive 
conception of a set could be. The third crucial feature 
of his work was its emphasis on the structural aspects 
of algebra. The presentation of number theory as a the- 
ory about objects that can be manipulated (in this case, 
added and multiplied) according to certain rules akin to 
those governing ordinary numbers was to be a paradigm 
of the more formal theories of the 20th century. 


THE THEORY OF EQUATIONS 


Another subject that was transformed in the 19th century 
was the theory of equations. Ever since Niccolo Tartaglia 
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and Lodovico Ferrari in the 16th century found rules 
giving the solutions of cubic and quartic equations in 
terms of the coefficients of the equations, formulas had 
unsuccessfully been sought for equations of the fifth and 
higher degrees. At stake was the existence of a formula 
that expressed the roots of a quintic equation in terms of 
the coefficients. This formula, moreover, had to involve 
only the operations of addition, subtraction, multiplica- 
tion, and division, together with the extraction of roots, 
since that was all that had been required for the solution 
of quadratic, cubic, and quartic equations. If such a for- 
mula were to exist, the quintic would accordingly be said 
to be solvable by radicals. 

In 1770 Lagrange had analyzed all the successful 
methods he knew for second-, third-, and fourth-degree 
equations in an attempt to see why they worked and how 
they could be generalized. His analysis of the problem in 
terms of permutations of the roots was promising, but he 
became more and more doubtful as the years went by that 
his complicated line of attack could be carried through. 
The first valid proof that the general quintic is not solvable 
by radicals was offered only after his death, in a startlingly 
short paper by Niels Henrik Abel, written in 1824. 

Abel also showed by example that some quintic 
equations were solvable by radicals and that some equa- 
tions could be solved unexpectedly easily. For exam- 
ple, the equation x - 1 = 0 has one root x = 1, but the 
remaining four roots can be found just by extracting 
square roots, not fourth roots as might be expected. He 
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therefore raised the question “What equations of degree 
higher than four are solvable by radicals?” 

Abel died in 1829 at the age of 26 and did not resolve 
the problem he had posed. Almost at once, however, 
the astonishing prodigy Evariste Galois burst upon the 
Parisian mathematical scene. He submitted an account 
of his novel theory of equations to the Academy of 
Sciences in 1829, but the manuscript was lost. A second 
version was also lost and was not found among Fourier’s 
papers when Fourier, the secretary of the academy, died 
in 1830. Galois was killed in a duel in 1832, at the age 
of 20, and it was not until his papers were published in 
Joseph Liouville’s Journal de mathématiques in 1846 that 
his work began to receive the attention it deserved. His 
theory eventually made the theory of equations into a 
mere part of the theory of groups. Galois emphasized 
the group (as he called it) of permutations of the roots 
of an equation. This move took him away from the 
equations themselves and turned him instead toward 
the markedly more tractable study of permutations. To 
any given equation there corresponds a definite group, 
with a definite collection of subgroups. To explain which 
equations were solvable by radicals and which were not, 
Galois analyzed the ways in which these subgroups were 
related to one another: solvable equations gave rise to 
what are now called a chain of normal subgroups with 
cyclic quotients. This technical condition makes it clear 
how far mathematicians had gone from the familiar 
questions of 18th-century mathematics, and it marks 
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a transition characteristic of modern mathematics: the 
replacement of formal calculation by conceptual analy- 
sis. This is a luxury available to the pure mathematician 
that the applied mathematician faced with a concrete 
problem cannot always afford. 

According to this theory, a group 1s a set of objects 
that one can combine in pairs in such a way that the 
resulting object is also in the set. Moreover, this way of 
combination has to obey the following rules (here objects 
in the group are denoted a, b, etc., and the combination 


of a and bis written a * b): 


1. There is an element e such that a* e=a=e* a for 
every element ain the group. This element is called 
the identity element of the group. 


2. For every element a there is an element, written a, 
with the property that a * a!'=e=a'!*a. The 
element a‘! is called the inverse of a. 

3. For every a, b, and c in the group the associative 
law holds: (a * b) * c =a * (b* c). 

Examples of groups include the integers with * 
interpreted as addition and the positive rational num- 
bers with * interpreted as multiplication. An important 
property shared by some groups but not all is com- 
mutativity: for every element a and b,a* b=b* a. 
The rotations of an object in the plane around a fixed 
point form a commutative group, but the rotations of 
a three-dimensional object around a fixed point form a 
noncommutative group. 
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Gauss 


A convenient way to assess the situation in mathematics 
in the mid-19th century is to look at the career of its 
greatest exponent, Carl Friedrich Gauss, the last man 
to be called the “Prince of Mathematics.” In 1801, the 
same year in which he published his Disquisitiones Arith- 
meticae, he rediscovered the asteroid Ceres (which had 
disappeared behind the Sun not long after it was first 
discovered and before its orbit was precisely known). He 
was the first to give a sound analysis of the method of 
least squares in the analysis of statistical data. Gauss 
did important work in potential theory and, with the 
German physicist Wilhelm Weber, built the first electric 
telegraph. He helped conduct the first survey of Earth’s 
magnetic field and did both theoretical and field work 
in cartography and surveying. He was a polymath who 
almost single-handedly embraced what elsewhere was 
being put asunder: the world of science and the world 
of mathematics. It is his purely mathematical work, 
however, that in its day was—and ever since has been— 
regarded as the best evidence of his genius. 

Gauss’s writings transformed the theory of num- 
bers. His theory of algebraic integers lay close to the 
theory of equations as Galois was to redefine it. More 
remarkable are his extensive writings, dating from 
1797 to the 1820s but unpublished at his death, on the 
theory of elliptic functions. In 1827 he published his 
crucial discovery that the curvature of a surface can be 
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defined intrinsically—that is, solely in terms of prop- 
erties defined within the surface and without reference 
to the surrounding Euclidean space. This result was to 
be decisive in the acceptance of non-Euclidean geom- 
etry. All of Gauss’s work displays a sharp concern for 
rigour and a refusal to rely on intuition or physical 
analogy, which was to serve as an inspiration to his 
successors. His emphasis on achieving full conceptual 
understanding, which may have led to his dislike of 
publication, was by no means the least influential of 
his achievements. 


Non-EucLipDEAN GEOMETRY 


Perhaps it was this desire for conceptual understanding 
that made Gauss reluctant to publish the fact that he was 
led more and more “to doubt the truth of geometry,” as 
he put it. For if there was a logically consistent geometry 
differing from Euclid’s only because it made a different 
assumption about the behaviour of parallel lines, it too 
could apply to physical space, and so the truth of (Euclid- 
ean) geometry could no longer be assured a priori, as 
Immanuel Kant had thought. 

Gauss’s investigations into the new geometry went 
farther than anyone else’s before him, but he did not pub- 
lish them. The honour of being the first to proclaim the 
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existence of a new geometry belongs to two others, who 
did so in the late 1820s: Nicolay Ivanovich Lobachevsky 
in Russia and Janos Bolyai in Hungary. Because the sim- 
ilarities in the work of these two men far exceed the dif- 
ferences, it is convenient to describe their work together. 

Both men made an assumption about parallel lines 
that differed from Euclid’s and proceeded to draw out its 
consequences. This way of working cannot guarantee 
the consistency of one’s findings, so, strictly speaking, 
they could not prove the existence of a new geometry in 
this way. Both men described a three-dimensional space 
different from Euclidean space by couching their find- 
ings in the language of trigonometry. The formulas they 
obtained were exact analogs of the formulas that describe 
triangles drawn on the surface of a sphere, with the usual 
trigonometric functions replaced by those of hyperbolic 
trigonometry. The functions hyperbolic cosine, written 
cosh, and hyperbolic sine, written sinh, are defined as 
follows: cosh x = (e + e )/2, and sinh x = (e - e )/2. 
They are called hyperbolic because of their use in describ- 
ing the hyperbola. Their names derive from the evident 
analogy with the trigonometric functions, which Euler 
showed satisfy these equations: cos x = (e +e )/2, and 
sin x =(e -e )/2i. The formulas were what gave the work 
of Lobachevsky and of Bolyai the precision needed to give 
conviction in the absence of a sound logical structure. Both 
men observed that it had become an empirical matter to 
determine the nature of space, Lobachevsky even going 
so far as to conduct astronomical observations, although 
these proved inconclusive. 
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The work of Bolyai and of Lobachevsky was poorly 
received. Gauss endorsed what they had done, but so 
discreetly that most mathematicians did not find out his 
true opinion on the subject until he was dead. The main 
obstacle each man faced was surely the shocking nature 
of their discovery. It was easier, and in keeping with 2,000 
years of tradition, to continue to believe that Euclidean 
geometry was correct and that Bolyai and Lobachevsky 
had somewhere gone astray, like many an investigator 
before them. 

The turn toward acceptance came in the 1860s, 
after Bolyai and Lobachevsky had died. The Italian 
mathematician Eugenio Beltrami decided to investigate 
Lobachevsky’s work and to place it, if possible, within the 
context of differential geometry as redefined by Gauss. 
He therefore moved independently in the direction already 
taken by Bernhard Riemann. Beltrami investigated the 
surface of constant negative curvature and found that on 
such a surface triangles obeyed the formulas of hyper- 
bolic trigonometry that Lobachevsky had discovered were 
appropriate to his form of non-Euclidean geometry. Thus, 
Beltrami gave the first rigorous description of a geome- 
try other than Euclid’s. Beltrami’s account of the surface 
of constant negative curvature was ingenious. He said it 
was an abstract surface that he could describe by drawing 
maps of it, much as one might describe a sphere by means 
of the pages of a geographic atlas. He did not claim to have 
constructed the surface embedded in Euclidean two-di- 
mensional space; David Hilbert later showed that it cannot 
be done. 
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RIEMANN 


When Gauss died in 1855, his post at Gottingen was 
taken by Peter Gustav Lejeune Dirichlet. One mathe- 
matician who found the presence of Dirichlet a stimulus 
to research was Bernhard Riemann, and his few short 
contributions to mathematics were among the most 
influential of the century. Riemann’s first paper, his doc- 
toral thesis (1851) on the theory of complex functions, 
provided the foundations for a geometric treatment of 
functions of a complex variable. His main result guar- 
anteed the existence of a wide class of complex func- 
tions satisfying only modest general requirements and so 
made it clear that complex functions could be expected 
to occur widely in mathematics. More important, Rie- 
mann achieved this result by yoking together the theory 
of complex functions with the theory of harmonic func- 
tions and with potential theory. The theories of complex 
and harmonic functions were henceforth inseparable. 

Riemann then wrote on the theory of Fourier series 
and their integrability. His paper was directly in the tra- 
dition that ran from Cauchy and Fourier to Dirichlet, 
and it marked a considerable step forward in the preci- 
sion with which the concept of integral can be defined. 
In 1854 he took up a subject that much interested Gauss, 
the hypotheses lying at the basis of geometry. 

The study of geometry has always been one of the 
central concerns of mathematicians. It was the language, 
and the principal subject matter, of Greek mathematics, 
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was the mainstay of elementary education in the subject, 
and has an obvious visual appeal. It seems easy to apply, 
for onecan proceed froma base of naively intelligible con- 
cepts. In keeping with the general trends of the century, 
however, it was just the naive concepts that Riemann chose 
to refine. What he proposed as the basis of geometry was 
far more radical and fundamental than anything that had 
gone before. 

Riemann took his inspiration from Gauss’s discov- 
ery that the curvature of a surface is intrinsic, and he 
argued that one should therefore ignore Euclidean space 
and treat each surface by itself. A geometric property, he 
argued, was one that was intrinsic to the surface. To do 
geometry, it was enough to be given a set of points and 
a way of measuring lengths along curves in the surface. 
For this, traditional ways of applying the calculus to the 
study of curves could be made to suffice. But Riemann 
did not stop with surfaces. He proposed that geometers 
study spaces of any dimension in this spirit—even, he 
said, spaces of infinite dimension. 

Several profound consequences followed from 
this view. It dethroned Euclidean geometry, which now 
became just one of many geometries. It allowed the 
geometry of Bolyai and Lobachevsky to be recognized 
as the geometry of a surface of constant negative cur- 
vature, thus resolving doubts about the logical consis- 
tency of their work. It highlighted the importance of 
intrinsic concepts in geometry. It helped open the way 
to the study of spaces of many dimensions. Last but not 
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least, Riemann’s work ensured that any investigation of 
the geometric nature of physical space would thereafter 
have to be partly empirical. One could no longer say that 
physical space is Euclidean because there is no geometry 
but Euclid’s. This realization finally destroyed any hope 
that questions about the world could be answered by a 
priori reasoning. 

In 1857 Riemann published several papers applying 
his very general methods for the study of complex func- 
tions to various parts of mathematics. One of these papers 
solved the outstanding problem of extending the theory 
of elliptic functions to the integration of any algebraic 
function. It opened up the theory of complex functions 
of several variables and showed how Riemann’s novel 
topological ideas were essential in the study of complex 
functions. (In subsequent lectures Riemann showed how 
the special case of the theory of elliptic functions could 
be regarded as the study of complex functions on a torus.) 

In another paper Riemann dealt with the question 
of how many prime numbers are less than any given 
number x. The answer is a function of x, and Gauss had 
conjectured on the basis of extensive numerical evidence 
that this function was approximately x/In(x). This turned 
out to be true, but it was not proved until 1896, when 
both Charles-Jean de la Vallée Poussin of Belgium and 
Jacques-Salomon Hadamard of France independently 
proved it. It is remarkable that a question about integers 
led to a discussion of functions of a complex variable, 
but similar connections had previously been made by 


EUROPEAN MATHEMATICS SINCE THE MIDDLE AGES 
a 


Dirichlet. Riemann took the expression [[(l — p') = 
Yn ’, introduced by Euler the century before, where the 
infinite product is taken over all prime numbers p and 
the sum over all whole numbers n, and treated it as a 
function of s. The infinite sum makes sense whenever s is 
real and greater than 1. Riemann proceeded to study this 
function when s is complex (now called the Riemann 
zeta function), and he thereby not only helped clarify the 
question of the distribution of primes but also was led to 
several other remarks that later mathematicians were to 
find of exceptional interest. One remark has continued 
to elude proof and remains one of the greatest conjec- 
tures in mathematics: the claim that the nonreal zeros of 
the zeta function are complex numbers whose real part 
is always equal to 1/2. 


RIEMANN’S INFLUENCE 


In 1859 Dirichlet died and Riemann became a full pro- 
fessor, but he was already ill with tuberculosis, and in 
1862 his health broke. He died in 1866. His work, how- 
ever, exercised a growing influence on his successors. His 
work on trigonometric series, for example, led to a deep- 
ening investigation of the question of when a function is 
integrable. Attention was concentrated on the nature of 
the sets of points at which functions and their integrals 
(when these existed) had unexpected properties. The con- 
clusions that emerged were at first obscure, but it became 
clear that some properties of point sets were important in 
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the theory of integration, while others were not. (These 
other properties proved to be a vital part of the emerging 
subject of topology.) The properties of point sets that 
matter in integration have to do with the size of the set. 
If one can change the values of a function on a set of 
points without changing its integral, it is said that the set 
is of negligible size. The naive idea is that integrating is 
a generalization of counting: negligible sets do not need 
to be counted. About the turn of the century the French 
mathematician Henri-Léon Lebesgue managed to sys- 
tematize this naive idea into a new theory about the size 
of sets, which included integration as a special case. In 
this theory, called measure theory, there are sets that can 
be measured, and they either have positive measure or 
are negligible (they have zero measure), and there are 
sets that cannot be measured at all. 

The first success for Lebesgue’s theory was that, 
unlike the Cauchy-Riemann integral, it obeyed the 
rule that if a sequence of functions f,(x) tends suitably 
to a function f(x), then the sequence of integrals [fn(x) 
dx tends to the integral [f(x)dx. This has made it the nat- 
ural theory of the integral when dealing with questions 
about trigonometric series. Another advantage 1s that it 
is very general. For example, in probability theory it is 
desirable to estimate the likelihood of certain outcomes 
of an experiment. By imposing a measure on the space 
of all possible outcomes, the Russian mathematician 
Andrey Kolmogorov was the first to put probability the- 
ory on a rigorous mathematical footing. 
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Another example is provided by a remarkable result 
discovered by the 20th-century American mathematician 
Norbert Wiener: within the set of all continuous functions 
on an interval, the set of differentiable functions has mea- 
sure zero. In probabilistic terms, therefore, the chance that 
a function taken at random is differentiable has probabil- 
ity zero. In physical terms, this means that, for example, a 
particle moving under Brownian motion almost certainly 
is Moving on a nondifferentiable path. This discovery clar- 
ified Albert Einstein’s fundamental ideas about Brownian 


motion (displayed by 


the continual motion of ff 


specks of dust in a fluid 
under the constant bom- 
bardment of surrounding 
molecules). The hope of 
physicists is that Rich- 
ard Feynman’s theory of 
quantum electrodynam- 
ics will yield to a similar 
measure-theoretic  treat- 
ment, for it has the dis- 
turbing aspect of a theory 
that has not been made 
rigorous mathematically 
but accords excellently 
with observation. 

Yet another setting 
for Lebesgue’s ideas was 


Norbert Wiener 
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to be the theory of Lie groups. The Hungarian mathe- 
matician Alfréd Haar showed how to define the concept 
of measure so that functions defined on Lie groups could 
be integrated. This became a crucial part of Hermann 
Weyl’s way of representing a Lie group as acting linearly 
on the space of all (suitable) functions on the group (for 
technical reasons, suitable means that the square of the 
function is integrable with respect to a Haar measure on 
the group). 


DIFFERENTIAL EQUATIONS 


Another field that developed considerably in the 19th 
century was the theory of differential equations. The 
pioneer in this direction once again was Cauchy. Above 
all, he insisted that one should prove that solutions do 
indeed exist; it is not a priori obvious that every ordinary 
differential equation has solutions. The methods that 
Cauchy proposed for these problems fitted naturally into 
his program of providing rigorous foundations for all the 
calculus. The solution method he preferred, although the 
less-general of his two approaches, worked equally well 
in the real and complex cases. It established the existence 
of a solution equal to the one obtainable by traditional 
power series methods by using newly developed tech- 
niques in his theory of functions of a complex variable. 
The harder part of the theory of differential equations 
concerns partial differential equations, those for which the 
unknown function is a function of several variables. In the 
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early 19th century there was no known method of proving 
that a given second- or higher-order partial differential 
equation had a solution, and there was not even a method 
of writing down a plausible candidate. In this case prog- 
ress was to be much less marked. Cauchy found new and 
more rigorous methods for first-order partial differential 
equations, but the general case eluded treatment. 

An important special case was successfully pros- 
ecuted, that of dynamics. Dynamics is the study of the 
motion of a physical system under the action of forces. 
Working independently of each other, William Rowan 
Hamilton in Ireland and Carl Jacobi in Germany showed 
how problems in dynamics could be reduced to systems 
of first-order partial differential equations. From this 
base grew an extensive study of certain partial differential 
operators. These are straightforward generalizations of a 
single partial differentiation (d/dx) to a sum of the form 

a i a + A, a ; 
Ox, Ox, 
where the a’s are functions of the x’s. The effect of perform- 
ing several of these in succession can be complicated, but 
Jacobi and the other pioneers in this field found that there 
are formal rules that such operators tend to satisfy. This 
enabled them to shift attention to these formal rules, and 
gradually an algebraic analysis of this branch of mathe- 
matics began to emerge. 

The most influential worker 1n this direction was the 

Norwegian Sophus Lie. Lie, and independently Wilhelm 
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Killing in Germany, came to suspect that the systems of 
partial differential operators they were studying came in 
a limited variety of types. Once the number of indepen- 
dent variables was specified (which fixed the dimension of 
the system), a large class of examples, including many of 
considerable geometric significance, seemed to fall into a 
small number of patterns. This suggested that the systems 
could be classified, and such a prospect naturally excited 
mathematicians. After much work by Lie and by Killing 
and later by the French mathematician Elie-Joseph Cartan, 
they were classified. Initially, this discovery aroused interest 
because it produced order where previously the complex- 
ity had threatened chaos and because it could be made to 
make sense geometrically. The realization that there were to 
be major implications of this work for the study of physics 
lay well in the future. 


LINEAR ALGEBRA 


Differential equations, whether ordinary or partial, may 
profitably be classified as linear or nonlinear; linear differ- 
ential equations are those for which the sum of two solu- 
tions is again a solution. The equation giving the shape 
of a vibrating string 1s linear, which provides the mathe- 
matical reason for why a string may simultaneously emit 
more than one frequency. The linearity of an equation 
makes it easy to find all its solutions, so in general linear 
problems have been tackled successfully, while nonlinear 
equations continue to be difficult. Indeed, in many linear 
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problems there can be found a finite family of solutions 
with the property that any solution is a sum of them 
(suitably multiplied by arbitrary constants). Obtaining 
such a family, called a basis, and putting them into their 
simplest and most useful form, was an important source 
of many techniques in the field of linear algebra. 

Consider, for example, the system of linear differen- 
tial equations 


dy, dy 
—=ayt+ by, ==cy,+ 
ce ay, + by2 rs cy, + dyr 


It is evidently much more difficult to study than the 
system dy/dx = ay,, dydx = By,, whose solutions are 
(constant multiples of) y, = exp (ax) and y, = exp (Bx). 
But if a suitable linear combination of y, and y, can be 
found so that the first system reduces to the second, then 
it is enough to solve the second system. The existence 
of such a reduction is determined by an array of the 
four numbers 


which is called a matrix. In 1858 the English mathemati- 
cian Arthur Cayley began the study of matrices in their 
own right when he noticed that they satisfy polynomial 
equations. The matrix 
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for example, satisfies the equation A - (a + dA + 
(ad — bc) = 0. Moreover, if this equation has two dis- 
tinct roots—say, a and B—then the sought-for reduction 
will exist, and the coefficients of the simpler system 
will indeed be those roots a and £. If the equation has 
a repeated root, then the reduction usually cannot be 
carried out. In either case the difficult part of solving 
the original differential equation has been reduced to 
elementary algebra. 

The study of linear algebra begun by Cayley and 
continued by Leopold Kronecker includes a powerful 
theory of vector spaces. These are sets whose elements 
can be added together and multiplied by arbitrary 
numbers, such as the family of solutions of a linear dif- 
ferential equation. A more familiar example is that of 
three-dimensional space. If one picks an origin, then 
every point in space can be labeled by the line segment 
(called a vector) joining it to the origin. Matrices appear 
as ways of representing linear transformations of a vec- 
tor space—.e., transformations that preserve sums and 
multiplication by numbers: the transformation T is lin- 
ear if, for any vectors u, v, 7(u+ v) = 7(u) + 7(v) and, 
for any scalar A, T;(Av) = A7(v). When the vector space 
is finite-dimensional, linear algebra and geometry form 
a potent combination. Vector spaces of infinite dimen- 
sions also are studied. 

The theory of vector spaces is useful in other ways. 
Vectors in three-dimensional space represent such phys- 
ically important concepts as velocities and forces. Such 
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an assignment of vector to point is called a vector field; 
examples include electric and magnetic fields. Scientists 
such as James Clerk Maxwell and J. Willard Gibbs took 
up vector analysis and were able to extend vector methods 
to the calculus. They introduced in this way measures of 
how a vector field varies infinitesimally, which, under the 
names div, grad, and curl, have become the standard tools 
in the study of electromagnetism and potential theory. To 
the modern mathematician, div, grad, and curl form part 
of a theory to which Stokes’s law (a special case of which 
is Green’s theorem) is central. The Gauss-Green-Stokes 
theorem, named after Gauss and two leading English 
applied mathematicians of the 19th century (George 
Stokes and George Green), generalizes the fundamental 
theorem of the calculus to functions of several variables. 
The fundamental theorem of calculus asserts that 


b 
/ ftx)=f(b) - fla) 


which can be read as saying that the integral of the 
derivative of some function in an interval is equal to the 
difference in the values of the function at the endpoints 
of the interval. Generalized to a part of a surface or 
space, this asserts that the integral of the derivative of 
some function over a region is equal to the integral of 
the function over the boundary of the region. In symbols 
this says that Jdw = ie where the first integral is taken 
over the region in question and the second integral over 
its boundary, while do is the derivative of o. 
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THE FOUNDATIONS OF GEOMETRY 


By the late 19th century the hegemony of Euclidean 
geometry had been challenged by non-Euclidean geom- 
etry and projective geometry. The first notable attempt 
to reorganize the study of geometry was made by the 
German mathematician Felix Klein and published 
at Erlangen in 1872. In his Erlanger Programm Klein 
proposed that Euclidean and non-Euclidean geometry 
be regarded as special cases of projective geometry. In 
each case the common features that, in Klein’s opinion, 
made them geometries were that there were a set of 
points, called a “space,” and a group of transformations 
by means of which figures could be moved around in 
the space without altering their essential properties. For 
example, in Euclidean plane geometry the space is the 
familiar plane, and the transformations are rotations, 
reflections, translations, and their composites, none of 
which change either length or angle, the basic properties 
of figures in Euclidean geometry. Different geometries 
would have different spaces and different groups, and the 
figures would have different basic properties. 

Klein produced an account that unified a large class 
of geometries—roughly speaking, all those that were 
homogeneous in the sense that every piece of the space 
looked like every other piece of the space. This excluded, 
for example, geometries on surfaces of variable curva- 
ture, but it produced an attractive package for the rest 
and gratified the intuition of those who felt that some- 
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how projective geometry was basic. It continued to look 
like the right approach when Lie’s ideas appeared, and 
there seemed to be a good connection between Lie’s clas- 
sification and the types of geometry organized by Klein. 

Mathematicians could now ask why they had 
believed Euclidean geometry to be the only one when, in 
fact, many different geometries existed. The first to take 
up this question successfully was the German mathema- 
tician Moritz Pasch, who argued in 1882 that the mis- 
take had been to rely too heavily on physical intuition. 
In his view an argument in mathematics should depend 
for its validity not on the physical interpretation of the 
terms involved but upon purely formal criteria. Indeed, 
the principle of duality did violence to the sense of 
geometry as a formalization of what one believed about 
(physical) points and lines; one did not believe that these 
terms were interchangeable. 

The ideas of Pasch caught the attention of the 
German mathematician David Hilbert, who, with the 
French mathematician Henri Poincaré, came to domi- 
nate mathematics at the beginning of the 20th century. 
In wondering why it was that mathematics—and 1n par- 
ticular geometry—produced correct results, he came to 
feel increasingly that it was not because of the lucidity 
of its definitions. Rather, mathematics worked because 
its (elementary) terms were meaningless. What kept it 
heading in the right direction was its rules of inference. 
Proofs were valid because they were constructed through 
the application of the rules of inference, according to 
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which new assertions could be declared to be true simply 
because they could be derived, by means of these rules, 
from the axioms or previously proven theorems. The the- 
orems and axioms were viewed as formal statements that 
expressed the relationships between these terms. 

The rules governing the use of mathematical terms 
were arbitrary, Hilbert argued, and each mathematician 
could choose them at will, provided only that the choices 
made were self-consistent. A mathematician produced 
abstract systems unconstrained by the needs of science, 
and if scientists found an abstract system that fit one of 
their concerns, they could apply the system secure in the 
knowledge that it was logically consistent. 

Hilbert first became excited about this point of view 
(presented in his Grundlagen der Geometrie [1899; “Foun- 
dations of Geometry”) when he saw that it led not merely 
to a clear way of sorting out the geometries in Klein’s 
hierarchy according to the different axiom systems they 
obeyed but to new geometries as well. For the first time 
there was a way of discussing geometry that lay beyond 
even the very general terms proposed by Riemann. Not all 
of these geometries have continued to be of interest, but 
the general moral that Hilbert first drew for geometry he 
was shortly to draw for the whole of mathematics. 


THE FOUNDATIONS OF MATHEMATICS 


By the late 19th century the debates about the founda- 
tions of geometry had become the focus for a running 
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debate about the nature of the branches of mathematics. 
Cauchy’s work on the foundations of the calculus, com- 
pleted by the German mathematician Karl Weierstrass 
in the late 1870s, left an edifice that rested on concepts 
such as that of the natural numbers (the integers 1, 2, 3, 
and so on) and on certain constructions involving them. 
The algebraic theory of numbers and the transformed 
theory of equations had focused attention on abstract 
structures in mathematics. Questions that had been 
raised about numbers since Babylonian times turned out 
to be best cast theoretically in terms of entirely modern 
creations whose independence from the physical world 
was beyond dispute. Finally, geometry, far from being a 
kind of abstract physics, was now seen as dealing with 
meaningless terms obeying arbitrary systems of rules. 
Although there had been no conscious plan leading in 
that direction, the stage was set for a consideration of 
questions about the fundamental nature of mathematics. 

Similar currents were at work in the study of logic, 
which had also enjoyed a revival during the 19th century. 
The work of the English mathematician George Boole 
and the American Charles Sanders Peirce had contrib- 
uted to the development of a symbolism adequate to 
explore all elementary logical deductions. Significantly, 
Boole’s book on the subject was called An Investiga- 
tion of the Laws of Thought, on Which Are Founded the 
Mathematical Theories of Logic and Probabilities (1854). 
In Germany the logician Gottlob Frege had directed 
keen attention to such fundamental questions as what it 
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means to define something and what sorts of purported 
definitions actually do define. 


MATHEMATICS IN THE ZOTH AND 
21st CENTURIES 


CANTOR 


All of these debates came together through the pioneer- 
ing work of the German mathematician Georg Can- 
tor on the concept of a set. Cantor had begun work in 
this area because of his interest in Riemann’s theory of 
trigonometric series, but the problem of what character- 
ized the set of all real numbers came to occupy him more 
and more. He began to discover unexpected properties of 
sets. For example, he could show that the set of all alge- 
braic numbers, and a fortiori the set of all rational num- 
bers, is countable in the sense that there is a one-to-one 
correspondence between the integers and the members 
of each of these sets by means of which for any member 
of the set of algebraic numbers (or rationals), no mat- 
ter how large, there is always a unique integer it may be 
placed in correspondence with. But, more surprisingly, 
he could also show that the set of all real numbers is not 
countable. So, although the set of all integers and the 
set of all real numbers are both infinite, the set of all 
real numbers 1s a strictly larger infinity. This was in com- 
plete contrast to the prevailing orthodoxy, which pro- 
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claimed that infinite could mean only “larger than any 
finite amount.” 

Here the concept of number was being extended 
and undermined at the same time. The concept was 
extended because it was now possible to count and order 
sets that the set of integers was too small to measure, 
and it was undermined because even the integers ceased 
to be basic undefined objects. Cantor himself had given 
a way of defining real numbers as certain infinite sets 
of rational numbers. Rational numbers were easy to 
define in terms of the integers, but now integers could 
be defined by means of sets. One way was given by Frege 
in Die Grundlagen der Arithmetik (1884; The Founda- 
tions of Arithmetic). He regarded two sets as the same 
if they contained the same elements. So in his opinion 
there was only one empty set (today symbolized by @), 
the set with no members. A second set could be defined 
as having only one element by letting that element be 
the empty set itself (symbolized by {@}), a set with two 
elements by letting them be the two sets just defined (1.e., 
{O, {O}+), and so on. Having thus defined the integers 
in terms of the primitive concepts “set” and “element 
of,” Frege agreed with Cantor that there was no log- 
ical reason to stop, and he went on to define infinite 
sets in the same way Cantor had. Indeed, Frege was 
clearer than Cantor about what sets and their elements 
actually were. 

Frege’s proposals went in the direction of a reduction 
of all mathematics to logic. He hoped that every mathe- 
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matical term could be defined precisely and manipulated 
according to agreed, logical rules of inference. This, the 
“logicist” program, was dealt an unexpected blow in 1902 
by the English mathematician and philosopher Bertrand 
Russell, who pointed out unexpected complications with 
the naive concept of a set. Nothing seemed to preclude the 
possibility that some sets were elements of themselves while 
others were not, but, asked Russell, ““What then of the set of 
all sets that were not elements of themselves?” If it is an ele- 
ment of itself, then it is not (an element of itself), but, if it 1s 
not, then it is. Russell had identified a fundamental problem 
in set theory with his paradox. Either the idea of a set as an 
arbitrary collection of already defined objects was flawed, 
or else the idea that one could legitimately form the set of 
all sets of a given kind was incorrect. Frege’s program never 
recovered from this blow, and Russell’s similar approach 
of defining mathematics in terms of logic, which he devel- 
oped together with Alfred North Whitehead in their Prin- 
cipia Mathematica (1910-13), never found lasting appeal 
with mathematicians. 

Greater interest attached to the ideas that Hilbert 
and his school began to advance. It seemed to them that 
what had worked once for geometry could work again 
for all of mathematics. Rather than attempt to define 
things so that problems could not arise, they suggested 
that it was possible to dispense with definitions and cast 
all of mathematics in an axiomatic structure using the 
ideas of set theory. Indeed, the hope was that the study 
of logic could be embraced in this spirit, thus making 
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logic a branch of mathematics, the opposite of Frege’s 
intention. There was considerable progress in this direc- 
tion, and there emerged both a powerful school of math- 
ematical logicians (notably in Poland) and an axiomatic 
theory of sets that avoided Russell’s paradoxes and the 
others that had sprung up. 

In the 1920s Hilbert put forward his most detailed 
proposal for establishing the validity of mathematics. 
According to his theory of proofs, everything was to be 
put into an axiomatic form, allowing the rules of infer- 
ence to be only those of elementary logic, and only those 
conclusions that could be reached from this finite set of 
axioms and rules of inference were to be admitted. He 
proposed that a satisfactory system would be one that 
was consistent, complete, and decidable. By “consistent” 
Hilbert meant that it should be impossible to derive both 
a statement and its negation; by “complete,” that every 
properly written statement should be such that either it 
or its negation was derivable from the axioms; by “decid- 
able,” that one should have an algorithm that determines 
of any given statement whether it or its negation is prov- 
able. Such systems did exist—for example, the first-order 
predicate calculus—but none had been found capable of 
allowing mathematicians to do interesting mathematics. 

Hilbert’s program, however, did not last long. In 
1931 the Austrian-born American mathematician and 
logician Kurt Godel showed that there was no system 
of Hilbert’s type within which the integers could be 
defined and that was both consistent and complete. 
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Independently, Godel, the English mathematician Alan 
Turing, and the American logician Alonzo Church later 
showed that decidability was also unattainable. Perhaps 
paradoxically, the effect of this dramatic discovery was to 
alienate mathematicians from the whole debate. Instead, 
mathematicians, who may not have been too unhappy 
with the idea that there is no way of deciding the truth of 
a proposition automatically, learned to live with the idea 
that not even mathematics rests on rigorous foundations. 
Progress since has been in other directions. An alterna- 
tive axiom system for set theory was later put forward by 
the Hungarian-born American mathematician John von 
Neumann, which he hoped would help resolve contem- 
porary problems in quantum mechanics. There was also 
a renewal of interest in statements that are both inter- 
esting mathematically and independent of the axiom 
system in use. The first of these was the American math- 
ematician Paul Cohen’s surprising resolution in 1963 of 
the continuum hypothesis, which was Cantor’s conjecture 
that the set of all subsets of the rational numbers was of 
the same size as the set of all real numbers. This turns out 
to be independent of the usual axioms for set theory, so 
there are set theories (and therefore types of mathematics) 
in which it is true and others in which it is false. 


MATHEMATICAL Puysics 


At the same time that mathematicians were attempting to 
put their own house in order, they were also looking with 


EUROPEAN MATHEMATICS SINCE THE MIDDLE AGES 
a 


renewed interest at contemporary work in physics. The 
man who did the most to rekindle their interest was Poin- 
caré. Poincaré showed that dynamic systems described 
by quite simple differential equations, such as the solar 
system, can nonetheless yield the most random-looking, 
chaotic behaviour. He went on to explore ways in which 
mathematicians can nonetheless say things about this 
chaotic behaviour and so pioneered the way in which 
probabilistic statements about dynamic systems can be 
found to describe what otherwise defies intelligence. 
Poincaré later turned to problems of electrodynam- 
ics. After many years’ work, the Dutch physicist Hendrik 
Antoon Lorentz had been led to an apparent dependence 
of length and time on motion, and Poincaré was pleased 
to notice that the transformations that Lorentz proposed 
as a way of converting one observer’s data into another’s 
formed a group. This appealed to Poincaré and strength- 
ened his belief that there was no sense in a concept 
of absolute motion; all motion was relative. Poincaré 
thereupon gave an elegant mathematical formulation of 
Lorentz’s ideas, which fitted them into a theory in which 
the motion of the electron is governed by Maxwell’s 
equations. Poincaré, however, stopped short of denying 
the reality of the ether or of proclaiming that the velocity 
of light is the same for all observers, so credit for the first 
truly relativistic theory of the motion of the electron rests 
with Einstein and his special theory of relativity (1905). 
Einstein’s special theory is so called because it 
treats only the special case of uniform relative motion. 
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The much more important case of accelerated motion 
and motion in a gravitational field was to take a further 
decade and to require a far more substantial dose of 
mathematics. Einstein changed his estimate of the value 
of pure mathematics, which he had hitherto disdained, 
only when he discovered that many of the questions he 
was led to had already been formulated mathematically 
and had been solved. He was most struck by theories 
derived from the study of geometry in the sense in which 
Riemann had formulated it. 

By 1915 a number of mathematicians were interested 
in reapplying their discoveries to physics. The leading instt- 
tution inthis respect was the University of Gottingen, where 
Hilbert had unsuccessfully attempted to produce a general 
theory of relativity before Einstein, and it was there that 
many of the leaders of the coming revolution in quantum 
mechanics were to study. There too went many of the lead- 
ing mathematicians of their generation, notably John von 
Neumann and Hermann Weyl, to study with Hilbert. In 
1904 Hilbert had turned to the study of integral equations. 
These arise in many problems where the unknown is itself 
a function of some variable, and especially in those parts 
of physics that are expressed in terms of extremal princi- 
ples (such as the principle of least action). The extremal 
principle usually yields information about an integral 
involving the sought-for function, hence the name inte- 
gral equation. Hilbert’s contribution was to bring together 
many different strands of contemporary work and to 
show how they could be elucidated if cast in the form of 
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arguments about objects in certain infinite-dimensional 
vector spaces. 

The extension to infinite dimensions was not a trivial 
task, but it brought with it the opportunity to use geomet- 
ric intuition and geometric concepts to analyze problems 
about integral equations. Hilbert left it to his students to 
provide the best abstract setting for his work, and thus 
was born the concept of a Hilbert space. Roughly, this 
is an infinite-dimensional vector space in which it makes 
sense to speak of the lengths of vectors and the angles 
between them; useful examples include certain spaces 
of sequences and certain spaces of functions. Operators 
defined on these spaces are also of great interest; their 
study forms part of the field of functional analysis. 

When in the 1920s mathematicians and _ physi- 
cists were seeking ways to formulate the new quantum 
mechanics, von Neumann proposed that the subject be 
written in the language of functional analysis. The quan- 
tum mechanical world of states and observables, with 
its mysterious wave packets that were sometimes like 
particles and sometimes like waves depending on how 
they were observed, went very neatly into the theory of 
Hilbert spaces. Functional analysis has ever since grown 
with the fortunes of particle physics. 


ALGEBRAIC TOPOLOGY 


The early 20th century saw the emergence of a number 
of theories whose power and utility reside in large part 
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in their generality. Typically, they are marked by an 
attention to the set or space of all examples of a partic- 
ular kind. (Functional analysis is such an endeavour.) 
One of the most energetic of these general theories was 
that of algebraic topology. In this subject a variety of 
ways are developed for replacing a space by a group and 
a map between spaces by a map between groups. It is 
like using X-rays: information is lost, but the shadowy 
image of the original space may turn out to contain, 
in an accessible form, enough information to solve the 
question at hand. 

Interest in this kind of research came from var- 
ious directions. Galois’s theory of equations was an 
example of what could be achieved by transforming a 
problem in one branch of mathematics into a problem 
in another, more abstract branch. Another impetus 
came from Riemann’s theory of complex functions. He 
had studied algebraic functions—that is, loci defined 
by equations of the form f(x, y) = 0, where / is a poly- 
nomial in x whose coefficients are polynomials in y. 
When x and y are complex variables, the locus can be 
thought of as a real surface spread out over the x plane 
of complex numbers (today called a Riemann surface). 
To each value of x there correspond a finite number of 
values of y. Such surfaces are not easy to comprehend, 
and Riemann had proposed to draw curves along them 
in such a way that, if the surface was cut open along 
them, it could be opened out into a polygonal disk. He 
was able to establish a profound connection between the 
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Left: Pieces of a surface given by f(x, y) = O. Right: if the 


surface is cut along the curves, an octagon is obtained. 


minimum number of curves needed to do this for a given 
surface and the number of functions (becoming infinite 
at specified points) that the surface could then support. 
The natural problem was to see how far Riemann’s 
ideas could be applied to the study of spaces of higher 
dimension. Here two lines of inquiry developed. One 
emphasized what could be obtained from looking at the 
projective geometry involved. This point of view was fruit- 
fully applied by the Italian school of algebraic geometers. 
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Left: f(x, y) = x (x +1) y = O intersects itself at (x, y) 
(O, O). Right: E(x, y,z) = O=y (y +z) - x intersects 


itself along the z axis. The origin is a triple self-intersection. 


It ran into problems, which it was not wholly able to solve, 
having to do with the singularities a surface can possess. 
Whereas a locus given by f(x, y) = 0 may intersect itself 
only at isolated points, a locus given by an equation of 
the form f(x, y, z) = 0 may intersect itself along curves, a 
problem that caused considerable difficulties. The second 
approach emphasized what can be learned from the study 
of integrals along paths on the surface. This approach, 
pursued by Charles-Emile Picard and by Poincaré, pro- 
vided a rich generalization of Riemann’s original ideas. 
On this base, conjectures were made and a general 
theory produced, first by Poincaré and then by the Ameri- 
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can engineer-turned-mathematician Solomon Lefschetz, 
concerning the nature of manifolds of arbitrary dimen- 
sion. Roughly speaking, a manifold is the n-dimensional 
generalization of the idea of a surface; it is a space any 
small piece of which looks like a piece of n-dimensional 
space. Such an object is often given by a single algebraic 
equation in 7 + 1 variables. At first the work of Poin- 
caré and of Lefschetz was concerned with how these 
manifolds may be decomposed into pieces, counting the 
number of pieces and decomposing them in their turn. 
The result was a list of numbers, called Betti numbers 
in honour of the Italian mathematician Enrico Betti, 
who had taken the first steps of this kind to extend 
Riemann’s work. It was only in the late 1920s that the 
German mathematician Emmy Noether suggested how 
the Betti numbers might be thought of as measuring 
the size of certain groups. At her instigation a number 
of people then produced a theory of these groups, the 
so-called homology and cohomology groups of a space. 

Two objects that can be deformed into one another 
will have the same homology and cohomology groups. 
To assess how much information is lost when a space 
is replaced by its algebraic topological picture, Poin- 
caré asked the crucial converse question “According 
to what algebraic conditions is it possible to say that 
a space is topologically equivalent to a sphere?” He 
showed by an ingenious example that having the same 
homology is not enough and proposed a more delicate 
index, which has since grown into the branch of topol- 
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ogy called homotopy theory. Being more delicate, it is 
both more basic and more difficult. There are usually 
standard methods for computing homology and coho- 
mology groups, and they are completely known for 
many spaces. In contrast, there is scarcely an interest- 
ing class of spaces for which all the homotopy groups 
are known. Poincaré’s conjecture that a space with the 
homotopy of a sphere actually is a sphere was shown 
to be true in the 1960s in dimensions five and above, 
and in the 1980s it was shown to be true for four-di- 
mensional spaces. In 2006 Grigori Perelman was 
awarded a Fields Medal for proving Poincaré’s con- 
jecture true in three dimensions, the only dimension in 
which Poincaré had studied it. 


DEVELOPMENTS IN Pure MATHEMATICS 


The interest in axiomatic systems at the turn of the 
century led to axiom systems for the known algebraic 
structures, that for the theory of fields, for example, 
being developed by the German mathematician Ernst 
Steinitz in 1910. The theory of rings (structures in 
which it is possible to add, subtract, and multiply but 
not necessarily divide) was much harder to formalize. 
It is important for two reasons: the theory of algebraic 
integers forms part of it, because algebraic integers nat- 
urally form into rings; and (as Kronecker and Hilbert 
had argued) algebraic geometry forms another part. 
The rings that arise there are rings of functions defin- 
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able on the curve, surface, or manifold or are definable 
on specific pieces of it. 

Problems in number theory and algebraic geom- 
etry are often very difficult, and it was the hope of 
mathematicians such as Noether, who laboured to 
produce a formal, axiomatic theory of rings, that, by 
working at a more rarefied level, the essence of the 
concrete problems would remain while the distracting 
special features of any given case would fall away. This 
would make the formal theory both more general and 
easier, and to a surprising extent these mathematicians 
were successful. 

A further twist to the development came with the 
work of the American mathematician Oscar Zariski, 
who had studied with the Italian school of algebraic 
geometers but came to feel that their method of work- 
ing was imprecise. He worked out a detailed program 
whereby every kind of geometric configuration could 
be redescribed in algebraic terms. His work succeeded 
in producing a rigorous theory, although some, notably 
Lefschetz, felt that the geometry had been lost sight of 
in the process. 

The study of algebraic geometry was amenable to 
the topological methods of Poincaré and Lefschetz so 
long as the manifolds were defined by equations whose 
coefficients were complex numbers. But, with the cre- 
ation of an abstract theory of fields, it was natural to 
want a theory of varieties defined by equations with 
coefficients in an arbitrary field. This was provided 
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for the first time by the French mathematician André 
Weil, in his Foundations of Algebraic Geometry (1946), 
in a way that drew on Zariski’s work without suppress- 
ing the intuitive appeal of geometric concepts. Weil’s 
theory of polynomial equations is the proper setting 
for any investigation that seeks to determine what 
properties of a geometric object can be derived solely 
by algebraic means. But it falls tantalizingly short of 
one topic of importance: the solution of polynomial 
equations in integers. This was the topic that Weil took 
up next. 

The central difficulty is that in a field it is possible 
to divide but in a ring it is not. The integers form a ring 
but not a field (dividing 1 by 2 does not yield an inte- 
ger). But Weil showed that simplified versions (posed 
over a field) of any question about integer solutions to 
polynomials could be profitably asked. This transferred 
the questions to the domain of algebraic geometry. To 
count the number of solutions, Weil proposed that, 
since the questions were now geometric, they should 
be amenable to the techniques of algebraic topol- 
ogy. This was an audacious move, since there was no 
suitable theory of algebraic topology available, but 
Weil conjectured what results it should yield. The 
difficulty of Weil’s conjectures may be judged by the 
fact that the last of them was a generalization to this 
setting of the famous Riemann hypothesis about 
the zeta function, and they rapidly became the focus of 
international attention. 
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Weil, along with Claude Chevalley, Henri Car- 
tan, Jean Dieudonné, and others, created a group of 
young French mathematicians who began to publish 
virtually an encyclopaedia of mathematics under 
the name Nicolas Bourbaki, taken by Weil from an 
obscure general of the Franco-German War. Bour- 
baki became a self-selecting group of young mathe- 
maticians who were strong on algebra, and the indi- 
vidual Bourbaki members were interested in the Weil 
conjectures. In the end they succeeded completely. A 
new kind of algebraic topology was developed, and 
the Weil conjectures were proved. The generalized 
Riemann hypothesis was the last to surrender, being 
established by the Belgian Pierre Deligne in the early 
1970s. Strangely, its resolution still leaves the original 
Riemann hypothesis unsolved. 

Bourbaki was a key figure in the rethinking of 
structural mathematics. Algebraic topology was axi- 
omatized by Samuel Eilenberg, a Polish-born Amer- 
ican mathematician and Bourbaki member, and the 
American mathematician Norman Steenrod. Saun- 
ders Mac Lane, also of the United States, and Eilen- 
berg extended this axiomatic approach until many 
types of mathematical structures were presented in 
families, called categories. Hence there was a category 
consisting of all groups and all maps between them 
that preserve multiplication, and there was another 
category of all topological spaces and all continuous 
maps between them. To do algebraic topology was 
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to transfer a problem posed in one category (that 
of topological spaces) to another (usually that of 
commutative groups or rings). When he created the 
right algebraic topology for the Weil conjectures, the 
German-born French mathematician Alexandre Gro- 
thendieck, a Bourbaki of enormous energy, produced 
anew description of algebraic geometry. In his hands it 
became infused with the language of category theory. 
The route to algebraic geometry became the steepest 
ever, but the views from the summit have a naturalness 
and a profundity that have brought many experts to 
prefer it to the earlier formulations, including Weil’s. 
Grothendieck’s formulation makes algebraic geom- 
etry the study of equations defined over rings rather 
than fields. Accordingly, it raises the possibility that 
questions about the integers can be answered directly. 
Building on the work of like-minded mathematicians in 
the United States, France, and Russia, the German Gerd 
Faltings triumphantly vindicated this approach when 
he solved the Englishman Louis Mordell’s conjecture in 
1983. This conjecture states that almost all polynomial 
equations that define curves have at most finitely many 
rational solutions; the cases excluded from the conjecture 
are the simple ones that are much better understood. 
Meanwhile, Gerhard Frey of Germany had pointed 
out that, if Fermat’s last theorem 1s false, so that there are 
integers u, v, w such that v’ + v = w’ (p greater than 5), then 
for these values of u, v, and p the curve y = x(x —u' \(x + v’) 
has properties that contradict major conjectures of the 
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Japanese mathematicians Tantyama Yutaka and Shimura 
Goro about elliptic curves. Frey’s observation, refined 
by Jean-Pierre Serre of France and proved by the Ameri- 
can Ken Ribet, meant that by 1990 Taniyama’s unproven 
conjectures were known to imply Fermat’s last theorem. 

In 1993 the English mathematician Andrew 
Wiles established the Shimura-Taniyama conjectures 
in a large range of cases that included Frey’s curve and 
therefore Fermat’s last theorem—a major feat even with- 
out the connection to Fermat. It soon became clear that 
the argument had a serious flaw; but in May 1995 Wiles, 
assisted by another English mathematician, Richard 
Taylor, published a different and valid approach. In so 
doing, Wiles not only solved the most famous outstand- 
ing conjecture in mathematics but also triumphantly 
vindicated the sophisticated and difficult methods of 
modern number theory. 


MATHEMATICAL PHysiIcs AND THE THEORY OF GROUPS 


In the 1910s the ideas of Lie and Killing were taken up 
by the French mathematician Elie-Joseph Cartan, who 
simplified their theory and rederived the classification of 
what came to be called the classical complex Lie alge- 
bras. The simple Lie algebras, out of which all the others 
in the classification are made, were all representable as 
algebras of matrices, and, in a sense, Lie algebra is the 
abstract setting for matrix algebra. Connected to each 
Lie algebra there were a small number of Lie groups, 
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and there was a canonical simplest one to choose in 
each case. The groups had an even simpler geometric 
interpretation than the corresponding algebras, for they 
turned out to describe motions that leave certain prop- 
erties of figures unaltered. For example, in Euclidean 
three-dimensional space, rotations leave unaltered the 
distances between points; the set of all rotations about 
a fixed point turns out to form a Lie group, and it is one 
of the Lie groups in the classification. The theory of Lie 
algebras and Lie groups shows that there are only a few 
sensible ways to measure properties of figures in a linear 
space and that these methods yield groups of motions 
leaving the figures, which are (more or less) groups of 
matrices, unaltered. The result is a powerful theory that 
could be expected to apply to a wide range of problems 
in geometry and physics. 

The leader in the endeavours to make Cartan’s the- 
ory, which was confined to Lie algebras, yield results 
for a corresponding class of Lie groups was the Ger- 
man American Hermann Weyl. He produced a rich 
and satisfying theory for the pure mathematician and 
wrote extensively on differential geometry and group 
theory and its applications to physics. Weyl attempted 
to produce a theory that would unify gravitation and 
electromagnetism. His theory met with criticism from 
Einstein and was generally regarded as unsuccessful; 
only in the last quarter of the 20th century did sim- 
ilar unified field theories meet with any acceptance. 
Nonetheless, Weyl’s approach demonstrates how 
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the theory of Lie groups can enter into physics in a 
substantial way. 

In any physical theory the endeavour is to make 
sense of observations. Different observers make differ- 
ent observations. If they differ in choice and direction 
of their coordinate axes, they give different coordinates 
to the same points, and so on. Yet the observers agree 
on certain consequences of their observations: in New- 
tonian physics and Euclidean geometry they agree on 
the distance between points. Special relativity explains 
how observers in a state of uniform relative motion 
differ about lengths and times but agree on a quantity 
called the interval. In each case they are able to do so 
because the relevant theory presents them with a group 
of transformations that converts one observer’s mea- 
surements into another’s and leaves the appropriate 
basic quantities invariant. What Weyl proposed was 
a group that would permit observers in nonuniform 
relative motion, and whose measurements of the same 
moving electron would differ, to convert their measure- 
ments and thus permit the (general) relativistic study of 
moving electric charges. 

In the 1950s the American physicists Chen Ning 
Yang and Robert L. Mills gave a successful treatment of 
the so-called strong interaction in particle physics from 
the Lie group point of view. Twenty years later mathe- 
maticians took up their work, and a dramatic resurgence 
of interest in Weyl’s theory began. These new develop- 
ments, which had the incidental effect of enabling math- 
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ematicians to escape the problems in Weyl’s original 
approach, were the outcome of lines of research that 
had originally been conducted with little regard for phys- 
ical questions. Not for the first time, mathematics was to 
prove surprisingly effective—or, as the Hungarian-born 
American physicist Eugene Wigner said, “unreasonably 
effective”—in science. 

Cartan had investigated how much may be accom- 
plished in differential geometry by using the idea of 
moving frames of reference. This work, which was 
partly inspired by Einstein’s theory of general relativ- 
ity, was also a development of the ideas of Rieman- 
nian geometry that had originally so excited Einstein. 
In the modern theory one imagines a space (usually 


cylinder Mébius band 


Mobius band 
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a manifold) made up of overlapping coordinatized 
pieces. On each piece one supposes some functions 
to be defined, which might in applications be the val- 
ues of certain physical quantities. Rules are given for 
interpreting these quantities where the pieces overlap. 
The data are thought of as a bundle of information 
provided at each point. For each function defined on 
each patch, it is supposed that at each point a vector 
space is available as mathematical storage space for all 
its possible values. Because a vector space is attached 
at each point, the theory is called the theory of vec- 
tor bundles. Other kinds of space may be attached, 
thus entering the more general theory of fibre bun- 
dles. The subtle and vital point is that it is possible to 
create quite different bundles which nonetheless look 
similar in small patches. The cylinder and the Mobius 
band look alike in small pieces but are topologically 
distinct, since it is possible to give a standard sense 
of direction to all the lines in the cylinder but not to 
those in the Mobius band. Both spaces can be thought 
of as one-dimensional vector bundles over the circle, 
but they are very different. The cylinder is regarded as 
a “trivial” bundle, the Mobius band as a twisted one. 
In the 1940s and 50s a vigorous branch of alge- 
braic topology established the main features of the 
theory of bundles. Then, in the 1960s, work chiefly by 
Grothendieck and the English mathematician Michael 
Atiyah showed how the study of vector bundles on 
spaces could be regarded as the study of cohomology 
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theory (called K theory). More significantly still, in the 
1960s Atiyah, the American Isadore Singer, and others 
found ways of connecting this work to the study of a 
wide variety of questions involving partial differentia- 
tion, culminating in the celebrated Atiyah-Singer the- 
orem for elliptic operators. (Elliptic is a technical term 
for the type of operator studied in potential theory.) 
There are remarkable implications for the study of pure 
geometry, and much attention has been directed to the 
problem of how the theory of bundles embraces the the- 
ory of Yang and Mills, which it does precisely because 
there are nontrivial bundles, and to the question of how 
it can be made to pay off in large areas of theoretical 
physics. These include the theories of superspace and 
supergravity and the string theory of fundamental par- 
ticles, which involves the theory of Riemann surfaces in 
novel and unexpected ways. 


PROBABILISTIC MATHEMATICS 


The most notable change in the field of mathematics in 
the late 20th and early 21st centuries has been the grow- 
ing recognition and acceptance of probabilistic methods 
in many branches of the subject, going well beyond their 
traditional uses in mathematical physics. At the same 
time, these methods have acquired new levels of rigour. 
The turning point is sometimes said to have been the 
award of a Fields Medal in 2006 to French mathemati- 
cian Wendelin Werner, the first time the medal went to a 
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probabilist, but the topic had acquired a central position 
well before then. 

As noted above, probability theory was made into 
a rigorous branch of mathematics by Kolmogorov in 
the early 1930s. An early use of the new methods was 
a rigorous proof of the ergodic theorem by American 
mathematician George David Birkhoff in 1931. The air 
in a room can be used in an example of the theorem. 
When the system is in equilibrium, it can be defined 
by its temperature, which can be measured at regular 
intervals. The average of all these measurements over a 
period of time is called the time average of the tempera- 
ture. On the other hand, the temperature can be mea- 
sured at many places in the room at the same time, and 
those measurements can be averaged to obtain what is 
called the space average of the temperature. The ergodic 
theorem says that under certain circumstances and as 
the number of measurements increases indefinitely, the 
time average equals the space average. The theorem was 
immediately applied by American mathematician Joseph 
Leo Doob to give the first proof of Fisher’s law of maxi- 
mum likelihood, which British statistician Ronald Fisher 
had put forward as a reliable way to estimate the right 
parameters in fitting a given probability distribution to a 
set of data. Thereafter, rigorous probability theory was 
developed by several mathematicians, including Doob in 
the United States, Paul Lévy in France, and a group who 
worked with Aleksandr Khinchin and Kolmogorov in 
the Soviet Union. 
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Doob’s work was extended by the Japanese math- 
ematician Ito Kiyoshi, who did important work for 
many years on stochastic processes (that is, systems 
that evolve under a probabilistic rule). He obtained a 
calculus for these processes that generalizes the famil- 
iar rules of classical calculus to situations where it no 
longer applies. The Ito calculus found its most 
celebrated application in modern finance, where it 
underpins the Black-Scholes equation that is used in 
derivative trading. 

However, it remained the case, as Doob often 
observed, that analysts and probabilists tended to keep 
their distance from each other and did not sufficiently 
appreciate the merits of thinking rigorously about prob- 
abilistic problems (which were often left to physicists) 
or of thinking probabilistically in purely analytical 
problems. This was despite the growing success of prob- 
abilistic methods in analytical number theory, a devel- 
opment energetically promoted by Hungarian math- 
ematician Paul Erdos in a seemingly endless stream of 
problems of varying levels of difficulty (many of which 
he offered money for their solution). 

A major breakthrough in this subject occurred in 
1981, although it goes back to the work of Poincaré in 
the 1880s. His celebrated recurrence theorem in celes- 
tial mechanics had made it very plausible that a par- 
ticle moving in a bounded region of space will return 
infinitely often and arbitrarily close to any position it 
ever occupies. In the 1920s Birkhoff and others gave 
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this theorem a rigorous formulation in the language of 
dynamical systems and measure theory, the same setting 
as the ergodic theorem. The result was quickly stripped 
of its trappings in the theory of differential equations 
and applied to a general setting of a transformation of 
a space to itself. If the space is compact (for example, a 
closed and bounded subset of Euclidean space such as 
Poincaré had considered, but the concept is much more 
general) and the transformation is continuous, then the 
recurrence theorem holds. In particular, in 1981 Israeli 
mathematician Hillel Furstenberg showed how to use 
these ideas to obtain results in number theory, specifi- 
cally new proofs of theorems by Dutch mathematician 
Bartel van der Waerden and Hungarian American math- 
ematician Endre Szemeréd1. 

Van der Waerden’s theorem states that if the pos- 
itive integers are divided into any finite number of dis- 
joint sets (1.e., sets without any members in common) 
and k is an arbitrary positive integer, then at least 
one of the sets contains an arithmetic progression 
of length k. Szemerédi’s theorem extends this claim 
to any subset of the positive integers that is suitably 
large. These results led to a wave of interest that influ- 
enced a most spectacular result: the proof by British 
mathematician Ben Green and Australian mathemati- 
cian Terence Tao in 2004 that the set of prime numbers 
(which is not large enough for Szemerédi’s theorem to 
apply) also contains arbitrarily long arithmetic pro- 
gressions. This is one of a number of results in diverse 
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areas of mathematics that led to Tao’s being awarded 
a Fields Medal in 2006. 

Since then, Israeli mathematician Elon Linden- 
strauss, Austrian mathematician Manfred Euinsiedler, 
and Russian American mathematician Anatole Katok 
have been able to apply a powerful generalization of the 
methods of ergodic theory pioneered by Russian math- 
ematician Grigory Margulis to show that Littlewood’s 
conjecture in number theory is true for all but a very 
small set of integers. This conjecture is the claim about 
how well any two irrational numbers, x and y, can be 
simultaneously approximated by rational numbers of 
the form p/n and q/n. For this and other applications 
of ergodic theory to number theory, Lindenstrauss was 
awarded a Fields Medal in 2010. 

A major source of problems about probabilities 
is statistical mechanics, which grew out of thermody- 
namics and concerns with the motion of gases and other 
systems with too many dimensions to be treated any 
other way than probabilistically. For example, at room 
temperature there are around 10°’ molecules of a gas in 
a room. 

Typically, a physical process is modeled on a lat- 
tice, which consists of large arrangements of points that 
have links to their immediate neighbours. For technical 
reasons, much work is confined to lattices in the plane. 
A physical process is modeled by ascribing a state (e.g., 
+1 or —1, spin up or spin down) and giving a rule that 
determines at each instant how each point changes its 
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state according to the state of its neighbours. For exam- 
ple, if the lattice is modeling the gas in a room, the room 
should be divided into cells so small that there is either 
no molecule in the cell or exactly one. Mathematicians 
investigate what distributions and what rules produce an 
irreversible change of state. 

A typical such question is percolation theory, which 
has applications in the study of petroleum deposits. A 
typical problem starts with a lattice of points in the plane 
with integer coordinates, some of which are marked with 
black dots (“oil”). If these black dots are made at ran- 
dom, or if they spread according to some law, how likely 
is it that the resulting distribution will form one con- 
nected cluster, in which any black dot is connected to any 
other through a chain of neighbouring black dots? The 
answer depends on the ratio of the number of black dots 
to the total number of dots, and the probability increases 
markedly as this ratio goes above a certain critical size. 
A central problem here, that of the crossing probability, 
concerns a bounded region of the plane inside which 
a lattice of points is marked out as described, and the 
boundary is divided into regions. The question is: What 
is the probability that a chain of black dots connects two 
given regions of the boundary? 

If the view taken is that the problem is fundamen- 
tally finite and discrete, it is desirable that a wide range 
of discrete models or lattices lead to the same conclu- 
sions. This has led to the idea of a random lattice and 
a random graph, meaning the most typical one. One 
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starts by considering all possible initial configurations, 
such as all possible different distributions of black 
and white dots in a given plane lattice, or all possible 
different ways a given collection of computers could 
be linked together. Depending on the rule chosen for 
colouring a dot (say, the toss of a fair coin) or the rule 
for linking two computers, one obtains an idea of what 
sorts of lattices or graphs are most likely to arise (in the 
lattice example, those with about the same number of 
black and white dots), and these most likely lattices are 
called random graphs. The study of random graphs has 
applications in physics, computer science, and many 
other fields. 

The network of computers is an example of a graph. 
A good question is: How many computers should each 
computer be connected to before the network forms into 
very large connected chunks? It turns out that for graphs 
with a large number of vertices (say, a million or more) 
in which vertices are joined in pairs with probability p, 
there is a critical value for the number of connections 
on average at each vertex. Below this number the graph 
will almost certainly consist of many small islands, and 
above this number it will almost certainly contain one 
very large connected component, but not two or more. 
This component is called the giant component of the 
Erd6os-Rényi model (after Erdos and Hungarian mathe- 
matician Alfréd Rényi). 

A major topic in statistical physics concerns the 
way substances change their state (e.g., from liquid 
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to gas when they boil). In these phase transitions, as 
they are called, there is a critical temperature, such as 
the boiling point, and the useful parameter to study is 
the difference between this temperature and the tem- 
perature of the liquid or gas. It had turned out that 
boiling was described by a simple function that raises 
this temperature difference to a power called the crit- 
ical exponent, which is the same for a wide variety of 
physical processes. The value of the critical exponent 
is therefore not determined by the microscopic aspects 
of the particular process but is something more gen- 
eral, and physicists came to speak of universality for 
the exponents. In 1982 American physicist Kenneth 
G. Wilson was awarded the Nobel Prize for Physics 
for illuminating this problem by analyzing the way 
systems near a change of state exhibit self-similar 
behaviour at different scales (1.e., fractal behaviour). 
Remarkable though his work was, it left a number of 
insights in need of a rigorous proof, and it provided 
no geometric picture of how the system behaved. 

The work for which Werner was awarded his Fields 
medal in 2006, carried out partly in collaboration 
with American mathematician Gregory Lawler and 
Israeli mathematician Oded Schramm, concerned the 
existence of critical exponents for various problems 
about the paths of a particle under Brownian motion, 
a typical setting for problems concerning crossing 
probabilities (that is, the probability for a particle to 
cross a specific boundary). Werner’s work has greatly 
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illuminated the nature of the crossing curves, and the 
boundary of the regions that form in the lattice that 
are bounded by curves as the number of lattice points 
grows. In particular, he was able to show that Polish 
American mathematician Benoit Mandelbrot’s con- 
jecture regarding the fractal dimension (a measure of 
a shape’s complexity) of the boundary of the largest 
of these sets was correct. 

Mathematicians who regard these probabilistic 
models as approximations to a continuous reality seek 
to formulate what happens in the limit as the approxi- 
mations improve indefinitely. This connects their work 
to an older domain of mathematics with many pow- 
erful theorems that can be applied once the limiting 
arguments have been secured. There are, however, very 
deep questions to be answered about this passage to the 
limit, and there are problems where it fails, or where 
the approximating process must be tightly controlled 
if convergence is to be established at all. In the 1980s 
the British physicist John Cardy, following the work 
of Russian physicist Aleksandr Polyakov and others, 
had established on strong informal grounds a number 
of results with good experimental confirmation that 
connected the symmetries of conformal field theories 
in physics to percolation questions in a hexagonal 
lattice as the mesh of the lattice shrinks to zero. In 
this setting a discrete model is a stepping stone on the 
way to a continuum model, and so, as noted, the cen- 
tral problem is to establish the existence of a limit as 
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the number of points in the discrete approximations 
increases indefinitely and to prove properties about it. 
Russian mathematician Stanislav Smirnov established 
in 2001 that the limiting process for triangular lattices 
converged and gave a way to derive Cardy’s formulae 
rigorously. He went on to produce an entirely novel 
connection between complex function theory and 
probability that enabled him to prove very general 
results about the convergence of discrete models to the 
continuum case. For this work, which has applications 
to such problems as how liquids can flow through soil, 
he was awarded a Fields Medal in 2010. 


Chapter 4 


INDIAN AND 


EAST ASIAN 
MATHEMATICS 


his chapter covers the history of mathematics 

in the Indian subcontinent from ancient times 
through the beginning of the colonization of 

the region by Great Britain. During and after the 19th 


century, Indian mathematics merged with the modern 
Western stream of mathematics. 


Sanskrit, the classical language of India and the chief 
medium for its premodern mathematical texts, main- 
tained a strictly oral literary tradition for many centu- 
ries. Even after writing was introduced, the traditional 
writing materials, such as palm leaves, birch bark, and 
(later) paper, did not last long in the South Asian 
climate. The earliest surviving Sanskrit references 
to mathematical subjects are some number words in 
the Vedas, ancient sacred texts that were passed down 
by recitation and memorization. (The oldest surviving 
Veda manuscript dates from the 16th century.) For 
example, an invocation in the Yajurveda (“Veda of 
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Sacrifice”) includes names for successive powers of 10 
up to about 10° —well beyond the thousands and ten 
thousands familiar to other ancient cultures. Although 
the Indian number system seems always to have been 
decimal, in the Satapatha Brahmana (c. 1000 BCE; 
“Vedic Exegesis of a Hundred Paths”) there is an 
interesting sequence of divisions of 720 bricks into 
groups of successively smaller quantities, with the 
explicit exclusion of all divisors that are multiples of 
numbers which are relatively prime to 60 (1.e., their 
only common divisor is 1). This is reminiscent of the 
structure of ancient Babylonian sexagesimal division 
tables and may indicate (as do some later astronomi- 
cal texts) the influence of the base-60 mathematics of 
Mesopotamia. 

The people who left these traces of their thinking 
about numbers were members of the Brahman class, 
priestly functionaries employed in the preparation and 
celebration of the various ritual sacrifices. The rich- 
est evidence of their mathematical activity is found in 
the several Ist-millennium-BCE Sulbasutras (“Cord- 
Rules”), collections of brief prose sentences prescribing 
techniques for constructing the brick fire altars where the 
sacrifices were to be carried out. Using simple tools of 
ropes and stakes, the altar builders could produce quite 
sophisticated geometric constructions, such as trans- 
forming one plane figure into a different one of equal 
area. The recorded rules also indicate knowledge of geo- 
metric fundamentals such as the Pythagorean theorem, 
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values for the ratio of the circumference of a circle to its 
diameter (i.e., 1), and values for the ratio of the diagonal 
of a square to its side (\2). Different shapes and sizes 
of sacrificial altars were described as conferring differ- 
ent benefits—such as wealth, sons, and attainment of 
heaven—upon the sponsor of the sacrifice. Perhaps these 
ritual associations originally inspired the development of 
this geometric knowledge, or perhaps it was the other way 
around: the beauty and harmony of the geometric discov- 
eries were sacralized by integrating them into ritual. 


THE Post-Vepic CONTEXT 


During the rise of Buddhism and Jainism after 500 BCE, 
the connection between mathematical and religious 
thought persisted. But instead of altar constructions 
for animal sacrifices, which Buddhist and Jain princi- 
ples rejected, mathematics supplied a framework for 
cosmological and philosophical schemes. Jain authors 
in particular employed immense numbers (even infinity) 
in elaborate and vast models of the universe. These new 
religions, as well as the older Vedic religion—by this 
time mostly shorn of ritual animal slaughter and more 
akin to modern Hinduism—also required mathematical 
techniques for astronomical models in order to maintain 
their calendars. Some of these techniques, such as the 
use of sexagesimal units and employing linear “zigzag” 
functions to represent seasonal changes in the duration 
of daylight, seem to have been inspired by Mesopota- 
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mian sources that reached northwest India via the Ach- 
aemenian dynasty. 

Other applications of mathematics, such as in com- 
merce and administration, must also have flourished at 
this time, although only occasional brief allusions sur- 
vive. For instance, a Buddhist text (c. Ist century BCE) 
by Vasumitra mentions merchants’ “counting pits,” 
where tokens in a row of shallow depressions kept track 
of units, hundreds, and thousands (a tens pit may have 
been included but is not specified). Using these as a sim- 
ile for the changeable aspects of unchanging realities, 
Vasumitra says, “When [the same] clay counting-piece is 
in the place of units, it is denoted as one, when in hun- 
dreds, one hundred.” 


INDIAN NUMERALS AND 
THE DECIMAL PLACE-VALUE SYSTEM 


These centuries around the turn of the millennium 
also left some physical evidence concerning the forms 
of written numerals. The above-mentioned allusion to 
interchangeable tokens in counting pits suggests a form 
of decimal place value. However, inscriptions on monu- 
ments and deed plates reveal that early Indian numeral 
systems (e.g., the Brahmi numerals) were not place-val- 
ued; rather, they used different symbols for the same 
multiple of different powers of 10. Because epigraphical 
styles tend to be conservative and the number of known 
examples is not large, it is hard to tell exactly when and 
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how the transition was made to a purely place-value 
system—indeed, different systems must have coexisted 
for many years. But decimal place value must have been 
in use (at least among mathematical professionals) no 
later than the early 1st millennium ce. This is illustrated, 
for example, in a 3rd-century-ce Sanskrit adaptation 
of a Greek astrological text that uses the Indian “con- 
crete number” system, where names of things stand in 
for numbers associated with them—e.g., “moon” for 1, 
“eye” for 2, “Veda” for 4, “tooth” for 32, and so on. In 
this way, the compound “moon-Veda-eye-moon” would 
be read as 1,241, implying that the reader automatically 
assumed a strictly decimal place-value representation. 
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The founding of the Gupta dynasty in 320 ce is 
sometimes used as a convenient marker for the start of 
“classical” Indian civilization. For a while, considerable 
political consolidation and expansion took place within 
the subcontinent and beyond its shores to Southeast 
Asia, while direct contact with the West lessened after 
the heyday of trade with Rome. An increasing number 
of complete treatises on mathematical subjects survived 
from this period, beginning about the middle of the Ist 
millennium, in contrast to the scattered allusions and 
fragments of the ancient period. 


THE RoLe of ASTRONOMY AND ASTROLOGY 


Greek mathematical models in astronomy and astrology 
appeared in India following the invasion of Alexander 
the Great. These models were integrated with existing 
Indian material to produce an extremely fruitful sys- 
tem of Sanskrit mathematical astronomy and astrol- 
ogy, known as jyotisa. The intellectual place of ganita, 
according to the canons of Sanskrit literature, was 
located within jyotisa, which in turn was identified as 
one of the six Vedangas (“limbs of the Veda’), whose 
purpose was to support the proper performance of Vedic 
rituals. As a result, much of our knowledge of classical 
Indian mathematics is supplied by astronomical texts. Of 
course, there were many nonastronomical applications 
of ganita as well. Buddhists, Jains, and Hindus all valued 
mathematical astronomy for practical uses such as time- 
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keeping, calendrics, and astrology and also ascribed to it 
intellectual and spiritual importance. 

Among the earliest of these works that have been pre- 
served are the foundational treatises of two major astro- 
nomical schools: the Aryabhatiya of Aryabhata (c. 500 
CE) and the Brahma-sphuta-siddhanta (628; “Correctly 
Established Doctrine of Brahma”) of Brahmagupta. 
Little is known of these authors. Aryabhata lived in 
Kusumapura (near modern Patna), and Brahmagupta is 
said to have been from Bhillamala (modern Bhinmal), 
which was the capital of the Gurjara-Pratihara dynasty. 
The “schools” that grew from their works were not 
physical institutions but rather textual lineages, built up 
over the subsequent centuries by the successive works of 
other scholars. Although members of different schools 
frequently criticized the astronomical parameters and 
techniques preferred by their rivals, their fundamental 
mathematical knowledge was largely the same. 

The oldest surviving detailed survey of that knowl- 
edge is the first section of the Aryabhatiya, titled Ganita. 
Its verses are devoted to a mélange of mathematical top- 
ics ranging from extraction of square and cube roots to 
plane and solid geometry, simple proportions, construc- 
tion of a sine table, summation of series, solution of qua- 
dratic equations, and solution of indeterminate equations 
of the first degree (equations of the type ax — by = c). 

Brahmagupta collected his mathematical basics 
into two chapters of his treatise. Chapter 12, also called 
“Ganita,” discusses rules for the fundamental operations 
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on integers and fractions as well as for series, propor- 
tions, and geometry. Chapter 18 deals with indetermi- 
nate equations of the first and second degrees and with 
algebra techniques for linear and quadratic equations 
(including rules for sign manipulation and the arithme- 
tic of zero). Trigonometric rules and tables are stated in 
astronomical chapters that employ them, and another 
chapter deals briefly with calculations relating to prosody. 
Both the Aryabhatiya and, apparently, an early text 
of the Brahma-sphuta-siddhanta school entered the Mus- 
lim world and were translated into Arabic near the end of 
the 8th century, profoundly influencing the development 
of Islamic mathematical astronomy. The Indian decimal 
place-value numerals had been introduced into west- 
ern Asia earlier, and the arithmetic operations involv- 
ing them became widespread under the name “Indian 
computation.” The techniques called by Arabic speak- 
ers al-jabr (“algebra”) also may have been influenced by 
early Indian methods, although they do not reflect the 
Indian mathematicians’ routine acceptance of negative 
numbers or their later highly developed notation. 


CLASSICAL MATHEMATICAL LITERATURE 


Almost all known Sanskrit mathematical texts consist 
mostly of concise formulas 1n verse. This was the standard 
format for many types of Sanskrit technical treatises, and 
the task of making sense out of its compressed formulas 
was aided in all its genres by prose commentaries. Verse 
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rules about mathematics, like those in any other subject, 
were designed to be learned by heart, but that does not 
necessarily mean that nothing was expected of the stu- 
dent beyond rote memorization. Frequently the rules were 
ambiguously expressed, apparently deliberately, so that 
only someone who understood the underlying mathemat- 
ics would be able to apply them properly. Commentaries 
helped by providing at least a word-by-word gloss of the 
meaning and usually some illustrative examples—and in 
some cases even detailed demonstrations. 

Verse works on mathematics and astronomy faced 
the special challenge of verbally representing numbers 
(which frequently occurred in tables, constants, and exam- 
ples) in strict metrical formats. “Concrete numbers” seem 
to have been devised for just that purpose. Another useful 
technique, developed somewhat later (about 500 CE), was 
the so-called katapayadi system in which each of the 10 
decimal digits was assigned to a set of consonants (begin- 
ning with the letters k, t, p, and y), while vowels had no 
numerical significance. This meant that numbers could 
be represented not only by normal-sounding syllables but 
by actual Sanskrit words using appropriate consonants 
in the appropriate sequence. In fact, some astronomers 
constructed entire numerical tables in the form of katapa- 
yadi sentences or poems. 

The original physical appearance of these math- 
ematical writings is more mysterious than their verbal 
content, because the treatises survive only in copies 
dating from much later times and reflecting later scribal 
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conventions. There is a striking exception, however, in 
the Bakhshali manuscript, found in 1881 by a farmer in 
his field in Bakhshali (near modern Peshawar, Pakistan). 
Written in a variant of Buddhist Hybrid Sanskrit on 
birch bark, most likely about the 7th century, this man- 
uscript is the only known Indian document on mathe- 
matics from this early period; it shows what the mathe- 
matical notation of that time and place actually looked 
like. The 10 decimal digits, including a dot for zero, were 
standard, and mathematical expressions were written 
without symbols, except for a square cross “+” written 
after negative numbers. This notation probably comes 
from the Indian letter for r, which stands for the Sanskrit 
word rhna (“negative”). Syllabic abbreviations—such 
as yu for yuta (“added”) and mu for mula (“root’”)— 
indicated operations on quantities. 

Because there are so few surviving physical represen- 
tatives of mathematical works dating from earlier than 
the mid-2nd millennium, it is difficult to say when, where, 
and how some of these notational conventions changed. 
In later texts the writing of equations was formalized so 
that both sides had the same number and kinds of terms 
(with zero coefficients where necessary). Each unknown 
was designated by a different syllabic abbreviation, typi- 
cally standing for the name of a colour, a word meaning 
“unknown,” or (in word problems) the name of the com- 
modity or other thing that the unknown represented. The 
practice of writing a square cross after a negative number 
was generally replaced by that of putting a dot over it. 
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THE CHANGING STRUCTURE OF 
MATHEMATICAL KNOWLEDGE 


Conventions of classification and organization of 
mathematical subjects seem to have evolved rapidly in 
the second half of the Ist millennium. Brahmagupta’s 
two chapters on mathematics already hint at the emerg- 
ing distinction between pati-ganita (arithmetic; literally 
“board-computations” for the dust board, or sand- 
box, on which calculations were carried out) and bija- 
ganita (algebra; literally “seed-computations” for the 
manipulation of equations involving an unknown 
quantity, or seed); these were also called “manifest” 
and “unmanifest” calculation, respectively, alluding 
to the types of quantities that they dealt with. Pati- 
ganita comprised (besides definitions of basic weights 
and measures) eight “fundamental” operations of arith- 
metic: addition, subtraction, multiplication, division, 
squaring, square-root extraction, cubing, and cube- 
root extraction; these were supplemented by techniques 
for reducing fractions and solving various types of 
proportions. The operations were applied to problems 
dealing with mixtures (unequal composition of various 
elements), series, plane and solid geometry, and the 
triangular geometry of shadows. Formulas for finding 
areas and volumes, reckoning interest, summing series, 
solving quadratic equations, and solving permutations 
and combinations (later expanded to include magic 
squares) were part of the standard pati-ganita tool kit. 
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Bija-ganita was sometimes called “sixfold” because 
it excluded problems involving the cube root or cube 
of an unknown (although procedures for cubing alge- 
braic expressions were known). It covered techniques 
for manipulating signs and coefficients of unknown 
quantities as well as surds (square roots of nonsquare 
integers), rules for setting up and solving equations up 
to second order in one or more unknowns, and rules for 
finding solutions to indeterminate equations of the first 
and second degree. 


MAHAVIRA AND BHASKARA II 


The pati-ganita and bija-ganita systems of arithmetic 
and algebra are more or less what is found in the com- 
paratively few Sanskrit treatises that deal exclusively 
with mathematics (all, apparently, composed after 
the middle of the Ist millennium). The content and 
organization of the topics varies somewhat from one 
work to another, each author having his own ideas of 
what concepts should be stressed. For instance, the 
9th-century Ganita-sara-sangraha (“Compendium of 
the Essence of Mathematics”) by Mahavira reflects 
the Jain cast of his erudition in details such as 
the inclusion of some of the infinitesimal units of 
Jain cosmology in his list of weights and measures. 
Mahavira entirely omitted addition and subtraction 
from his discussion of arithmetic, instead taking 
multiplication as the first of the eight fundamental 
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operations and filling the gap with summation and 
subtraction of series. On the other hand, the best- 
known of all works on Indian arithmetic and algebra, 
the 12th-century Lilavati (“The Beautiful”) and the 
more advanced Bijaganita, by Bhaskara II, followed 
the conventional definition of the eight operations. 
Bhaskara asserted, however, that the “Rule of Three” 
(of proportionality) is the truly fundamental concept 
underlying both arithmetic and algebra: 


Just as this universe is pervaded by Vishnu...with his 
many forms...in the same way, this whole type of com- 
putation is pervaded by the [rule of] three quantities. 


Bhaskara’s two works are interesting as well for 
their approaches to the arithmetic of zero. Both repeat 
the standard (though not universal) idea that a quantity 
divided by zero should be defined simply as “zero-di- 
vided” and that, if such a quantity is also multiplied by 
zero, the zeros cancel out to restore the original quantity. 
But the Bijaganita adds: 


In this quantity also which has zero as its divisor there 
is no change even when many [quantities ] have entered 
into it or come out [of it], just as at the time of destruc- 
tion and creation when throngs of creatures enter into 
and come out of [him, there is no change in] the infinite 
and unchanging [ Vishnu ]. 


This suggests that the quantitative result of dividing 
by zero was considered to be an infinite amount, possi- 
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bly reflecting greater sophistication of these concepts in 
the more advanced Bijaganita. 

Much additional mathematical material was dealt 
with in Sanskrit astronomical treatises—for example, 
trigonometry of chords, sines, and cosines and various 
kinds of numerical approximation, such as interpolation 
and iterative rules. 


TEACHERS AND LEARNERS 


Almost every known mathematical author also wrote works 
on jyotisa, or astronomy and astrology. This genre was so 
closely linked with that of ganita that it was not always clear 
to which of them a particular text belonged; for example, 
Bhaskara’s Lilavati and Bijaganita were often considered 
to be chapters of his astronomical magnum opus, Sid- 
dhanta-siromani (“Crest-Jewel of Astronomical Systems’’). 
These astronomical works were primarily aimed at students 
and scholars pursuing astronomy, astrology, and calendrics 
as their hereditary occupation (generally Hindu Brah- 
mans or scholar-monks of the heterodoxies). However, the 
need for more general instruction in ganita must certainly 
have affected a much broader segment of the population. 
Sample problems in mathematical texts (usually phrased in 
the second person as though addressed to a student) fre- 
quently discuss commercial transactions and often include 
vocatives such as “merchant” or “best of merchants,” sug- 
gesting that the intended audience included members of 
the mercantile class. 
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Furthermore, some problems contain feminine voc- 
atives such as “dear one” or “beautiful one,” particularly 
in the Lilavati of Bhaskara, which later legend holds 
to have been named after, and written for, the author’s 
daughter. There is a reference in a 15th-century text to 
certain mixture problems posed by mathematicians to 
ladies of the court, and many classical lists of the kalas, 
or civilized arts, include certain kinds of mathematical 
recreations, sometimes just mathematics in general, or 
even astronomy. Though the available details are very 
sparse, refined education for many upper-class peo- 
ple of both sexes was apparently expected to include 
some mathematics. 


THE SCHOOL OF MADHAVA IN KERALA 


Some of the most fascinating mathematical develop- 
ments in India in the 2nd millennium—indeed, in the 
history of mathematics as a whole—emerged from 
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the now-famous school of Madhava in Kerala on 
the Malabar Coast, a key region of the international 
spice trade. Madhava himself worked near the end of 
the 14th century, and verses attributed to him in the 
writings of his successors testify to his brilliant con- 
tributions on such topics as infinite series and the use 
of infinitesimal quantities. The work of these mathe- 
maticians anticipated several discoveries of the later 
European analysts, including power series for the sine, 
cosine, and arctangent which were also used to obtain 
m to 11 decimal places. Generations of Madhava’s fol- 
lowers—in particular Jyesthadeva, Nilakantha, and 
Sankara—supplied ingenious geometric demonstra- 
tions of these mathematical ideas. This remarkable 
school also provides one of the few known examples 
within Indian mathematics of a continuous chain of 
identified direct teacher-pupil contacts extending over 
the course of centuries, from Madhava in the late 
1300s through at least the early 1600s. 

Meanwhile, in the northern parts of India, inva- 
sion, war, and religious and caste exclusivity did not 
prevent a blending of Indian and Islamic mathematics 
in encounters between astronomers, particularly at the 
imperial Mughal courts. Islamic scientific works (mostly 
in Persian) were collaboratively translated into Sanskrit 
and vice versa. Concepts and results from Greco-Is- 
lamic spherical trigonometry, astronomical tables, and 
mathematical instruments thus found their way into 
Sanskrit jyotisa. 
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Similar practices at the start of Western colo- 
nization in the 16th century introduced such topics 
as logarithms and heliocentrism into a few Sanskrit 
texts. Even after the colonial policy of basing “native 
education” on an English curriculum was established 
in the 19th century, some scholars continued to recast 
foreign mathematics in the form of traditional San- 
skrit verse treatises. However, this work was overshad- 
owed by the rise of Indian mathematical research and 
mathematical societies on the lines of Western models. 
For the most part, by the end of the 19th century the 
river of Indian ganita had been fully merged into the 
ocean of modern mathematics. 


MATHEMATICS IN CHINA 


Books written in China from the Ist century BCE through 
the 7th century CE and then in the 13th century formed 
the foundation for the development of mathematics in 
East Asia. Most subsequent mathematical works refer 
to them. References found in the surviving mathemati- 
cal writings from this period, as well as references made 
in bibliographies compiled for dynastic annals, indicate 
that there are many gaps in the textual record. The old- 
est extant works probably survived because they became 
official books, taught in the context of the Chinese civil 
examination system. 

The most important work in the history of math- 
ematics in Chinese is Jiuzhang suanshu (The Nine 
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Chapters on the Mathematical Art), which contains 
arithmetic, algebraic, and geometric algorithms, pre- 
sented in relation to problems, some of which evoke 
the duties of the civil administration: surveying fields 
(areas), levying taxes according to various types of 
grains (ratios), determining wages for civil servants 
according to their position in the hierarchy (unequal 
sharing), measuring planned earthworks to determine 
labour needs and granaries to determine storage capac- 
ity (volumes), levying fair taxes (problems combining 
various proportions), and so forth. This compilation 
from the Ist century BCE or CE (specialists disagree 
on the exact date of its completion) has been restored 
based on two main sources. The oldest extant copy, 
which is also the oldest existing mathematics book 
ever printed, dates from 1213. However, only the first 
five chapters survive. The complete book known today 
as The Nine Chapters is the result of an 18th-century 
philological work based on both the former source and 
exhaustive quotations in a 15th-century Chinese ency- 
clopaedia, Yongle dadian, compiled under the Yongle 
emperor (1402-24). In any case, like Euclid’s Ele- 
ments, The Nine Chapters gathered and organized 
many mathematical achievements (including arithme- 
tic, geometric, and algebraic algorithms) from preced- 
ing periods. And like the Elements in the West, The 
Nine Chapters played a preeminent role in the devel- 
opment of mathematics in East Asia. Most mathe- 
maticians referred to it, and most of the subjects that 
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they worked on stemmed from it. Its format, adopted 
by most subsequent authors, consists of problems for 
which a numerical answer and a general procedure for 
solution are given. As with any canonical work, many 
scholars wrote commentaries on The Nine Chapters, 
adding explanations and proofs, rewriting procedures, 
and suggesting new ones. The most important surviv- 
ing commentary, attributed to Liu Hui (3rd century), 
contains the earliest Chinese mathematical proofs in 
the modern sense. 

During the 7th century, certain other books were 
gathered together with The Nine Chapters and a Han 
astronomical treatise, Zhoubi (“The Gnomon of the 
Zhou”), by a group under the leadership of imperial 
mathematician and astronomer Li Chunfeng. This 
collection, known as Shibu suanjing (“Ten Classics 
of Mathematics’), became the manual for officials 
trained in the newly established office of mathemat- 
ics. Although some people continued to be officially 
trained as mathematicians thereafter, no advancement 
in mathematics can be documented until the 11th 
century. At that time (1084) the “Ten Classics” was 
edited and printed, an event that seems to have been 
related to renewed activity in mathematics during the 
11th and 12th centuries. This activity is known only 
through later quotations, but it probably paved the way 
for major achievements in the second half of the 13th 
century. At that time China was divided into North 
and South, and achievements by mathematicians in 
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both regions are known: in the South those of Qin 
Jiushao and Yang Hui, and in the North those of Li 
Ye and Zhu Shijie. Mathematical studies in the North 
and South seem to have developed independently, but 
they attest to a common basis. 

While some major works of the 13th century are 
recorded in the Yongle dadian, mathematical knowledge 
quickly deteriorated, as demonstrated by commentaries on 
these books written by the end of the 15th century that show 
that they were no longer understood. By the 17th century 
few ancient Chinese mathematical works were available. 
Thereafter, as Chinese scholars became aware of European 
achievements, they began to look for such works throughout 
the country and strove to interpret them. The end of the 18th 
century saw a large movement of editing rediscovered texts. 
These critical editions are the main sources today for the 
history of Chinese mathematics. Discoveries of new sources 
are now rare, though in the 20th century a mathematical 
book was found in a grave sealed before the end of the 2nd 
century BCE, pushing back by centuries the earliest known 
source on the subject. It is possible that archaeology will 
bring to light new findings and provoke a revolution com- 
parable to that experienced in the historiography of China 
in general. 


THe Nine CHAPTERS 


The Nine Chapters presupposes mathematical knowledge 
about how to represent numbers and how to perform 
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the four arithmetic operations of addition, subtraction, 
multiplication, and division. In it the numbers are writ- 
ten in Chinese characters, but, for most of the proce- 
dures described, the actual computations are intended 
to be performed on a surface, perhaps on the ground. 
Most probably, as can be inferred from later accounts, 
on this surface, or counting board, the numbers were 
represented by counting rods that were used according 
to a decimal place-value system. Numbers represented 
by counting rods could be moved and modified within 
a computation. However, no written computations were 
recorded until much later. As will be seen, setting up the 
computations with counting rods greatly influenced later 
mathematical developments. 

The Nine Chapters contains a number of mathemat- 
ical achievements, already in a mature form, that were 
presented by most subsequent books without substantial 
changes. The most important achievements are described 
briefly in the rest of this section. 


ARITHMETIC OF FRACTIONS 


Division is a central operation in The Nine Chapters. 
Fractions are defined as a part of the result of a division, 
the remainder of the dividend being taken as the numer- 
ator and the divisor as the denominator. Thus, dividing 
17 by 5, one obtains a quotient of 3 and a remainder 
of 2; this gives rise to the mixed quantity 3 + 2/5. The 
fractional parts are thus always less than one, and their 
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arithmetic is described through the use of division. For 
instance, to get the sum of a set of fractions, one is 
instructed to 


multiply the numerators by the denominators that do 
not correspond to them, add to get the dividend. Mul- 
tiply the denominators all together to get the divisor. 
Perform the division. If there is a remainder, name it 
with the divisor. 


This algorithm corresponds to the modern for- 
mula a/b + c/d = (ad + bc)/bd. The sum of a set of frac- 
tions is itself thus the result of a division, of the form 


Columns were used for 
equations, from right to 
left: 

3x + 21y-—3z=0 

-6x - 2y-—z=62 

2x — 3y + 8Z = 32 


= Negative 


«=e Positive 
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Counting boards and markers, or counting 
rods, were used in China to solve systems of 


linear equations. This is an example from the 
Ist century CE. 
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“integer plus proper fraction.” All the arithmetic oper- 
ations involving fractions are described in a similar way. 


ALGORITHMS FOR AREAS AND VOLUMES 


The Nine Chapters gives formulas for elementary plane and 
solid figures, including the areas of triangles, rectangles, 
trapezoids, circles, and segments of circles and the volumes 
of prisms, cylinders, pyramids, and spheres. All these for- 
mulas are expressed as lists of operations to be performed 
on the data in order to get the result—+e., as algorithms. 
For example, to compute the area of a circle, the following 
algorithm is given: “multiply the diameter by itself, triple 
this, divide by four.” This algorithm amounts to using 3 as 
the value for n. Commentators added improved values for 
m along with some derivations. The commentary ascribed 
to Liu Hui computes two other approximations for 7, one 
slightly low (157/50) and one high (3,927/1,250). The Nine 
Chapters also provides the correct formula for the area of 
the circle—“‘multiplying half the diameter and half the cir- 
cumference, one gets the area”—which Liu Hui proved. 


SOLUTION OF SYSTEMS OF SIMULTANEOUS 
LINEAR EQUATIONS 


The Nine Chapters devotes a chapter to the solution of 
simultaneous linear equations—that 1s, to collections of 
relations between unknowns and data (equations) where 
none of the unknown quantities is raised to a power 
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higher than 1. For example, the first problem in this 
chapter, on the yields from three grades of grain, asks: 


3 bundles of top-grade grain, 2 bundles of medium 
grade, and I bundle of low grade yield 39 units of grain. 
2 bundles of top grade, 3 bundles of medium grade, and 
I bundle of low grade yield 34 units. I bundle of top 
grade, 2 bundles of medium grade, and 3 bundles of low 
grade yield 26 units. How many units does a bundle of 
each grade of grain yield? 


The procedure for solving a system of three equations 
in three unknowns involves arranging the data on the com- 
puting surface in the form of a table. The coefficients of 
the first equation are arranged in the first column and the 
coefficients of the second and third equations in the sec- 
ond and third columns. Consequently, the numbers of the 
first row, comprising the first coefficient in each equation, 
correspond to the first unknown. This is an instance of a 
place-value notation, in which the position of a number 
in a numerical configuration has a mathematical meaning. 


The coefficients of each equation are arranged 
in columns in the top box (Western numerals 


are used for clarity). The coefficients of the first xtay+3z=4 
column are multiplied by 2 and subtracted from represents 2x+3y+z=10 
the second column, which is then multiplied by 3x + 2y+z=32 


-1; the coefficients of the first column are then 
multiplied by 3 and subtracted from the third 
column to produce the first box below. Then in 
the first below, the second column is multiplied 
by 4 and added to the third column, which is 
then divided by 12 to produce the next box. 
Since it can now be seen that z = 1, this 
number can be “back-substituted” to yield y= 7 
and x= 15, as shown in the last box. 
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System of equations in Chinese mathematics 
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The main tool for the solution is the use of column reduc- 
tion (elimination of variables by reducing their coefficients 
to zero) to obtain an equivalent configuration. Next, the 
unknown of the third row is found by division, and hence 
the second and the first unknowns are found as well. This 
algorithm is known in the West as Gauss elimination. 

The algorithm described above relies in an essen- 
tial way on the configuration given to the set of data on 
the counting surface. Because the procedure implies a 
column-to-column subtraction, it gives rise to negative 
numbers. The Nine Chapters describes detailed methods 
for computing with positive and negative coefficients that 
enable problems involving two to seven unknowns to be 
solved. This seems to be the first occurrence of negative 
numbers in the history of mathematics. 


SQUARE AND CuBeE Roots 


In The Nine Chapters, algorithms for finding integral parts 
of square roots or cube roots on the counting surface are 
based on the same idea as the arithmetic ones used today. 
These algorithms are set up on the surface in the same way 
as is a division: at the top, the “quotient”; under it, the 
“dividend”; one row below, the “divisor”; at the bottom, 
auxiliary computations. Moreover, the algorithms are 
written out, sentence by sentence, parallel to each other, 
so that their similarities and differences become clear. 
Commenting on these algorithms, Liu Hui sug- 
gested that one could continue computing the non- 
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integral portion of a root in the same way, setting 10 
as denominator for the first subsequent digit, 100 as 
denominator for the first two digits, and so on; he thus 
gave the root in terms comparable to decimal fractions. 
Moreover, in case the algorithm, which generates dig- 
it-by-digit the root of an integer N, did not stop with 
the digit for the units (N was not a perfect square), The 
Nine Chapters stated that another way of providing 
the result of the square root algorithm should be used: 
the root should be given in the form “side of N,” which 
should be understood to mean “square root of N.” 
Thus, quadratic irrationals (an irrational number 
that is the solution to some quadratic equation of the 
kind x = N) were introduced in ancient China and the 
commentaries attest to their use in computations. 

The procedure for extracting square roots was also 
applied to the solution of quadratic equations (in modern 
notation, equations of the form x + bx = c). The qua- 
dratic equation appears to have been conceived of as an 
arithmetic operation with two terms (6 and c). Moreover, 
the equation was thought to have only one root. The the- 
ory of equations developed in China within that frame- 
work until the 13th century. The solution by radicals that 
Babylonian mathematicians had already explored has not 
been found in the Chinese texts that survive. However, the 
specific approach to equations that developed in China 
occurs from at least the end of the 12th century onward in 
Arabic sources, where it is meshed with approaches from 
other parts of the ancient world. 
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PROBLEMS INVOLVING RIGHT TRIANGLES 


Right-angled triangles also constituted a domain in which 
research continued until the 13th century in China. The 
so-called Pythagorean theorem 1s given, under an algorith- 
mic form, in The Nine Chapters. Algorithms are provided 
to solve various problems on right triangles such as the 
following: “Given the base, and the sum of the height and 
of the hypotenuse, find the height and the hypotenuse.” 
Other algorithms are given for determining the diameter 
of an inscribed circle and the side of an inscribed square. 


THE ComMENTARY oF Liu Hul 


Liu Hur’s 3rd-century commentary on The Nine Chapters 1s 
the most important text dating from before the 13th century 
that contains proofs in the modern sense. His commentary 
on the algorithms for computing the volumes of bodies 
exemplifies the kind of mathematical work that he carried 
out throughout the book for the sake of exegesis. Liu proved 
the algorithms already presented in The Nine Chapters, and 
he also provided and proved new algorithms for the same 
three-dimensional volumes. In addition, he organized these 
algorithms, given one after the other without comment 
in The Nine Chapters, into a system in which proofs for one 
algorithm use only algorithms that had already been estab- 
lished independently. He used a small set of proof tech- 
niques, including dissection (even into an infinite number 
of pieces), decomposition into known pieces and recompo- 
sition, and a simplified version of what became known later 
in the West as Cavalieri’s principle, which states that, if two 
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solids of the same height are such that their corresponding 
sections at any level have the same areas, then they have the 
same volume. More precisely, Liu deduced the volume of 
a solid whose cross sections are circles by circumscribing 
each section with a square. (A finer version of Cavalieri’s 
principle was used by Zu Gengzhi in the 5th century to 
establish the correctness of the algorithm computing the 
volume of a sphere.) 

The great importance of Liu Hui’s commentary 
on The Nine Chapters lies in the fact that he proved the 
correctness of algorithms not only in geometry but also in 
arithmetic and algebra. In the course of proving algorithms 
given in various sections of the work, he compared them 
with one another and demonstrated how the same formal 
operations, which he called the “key steps” of computation, 
are brought into play in different algorithms. For example, 
in comparing the procedures for adding fractions and for 
solving simultaneous linear equations (described above )}—a 


Cavalieri's principle 
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comparison which is carried out while establishing their 
correctness—Liu showed that sets of numbers are involved 
(numerator and denominator for a fraction, the coefficients 
of an equation for systems of equations) which share the 
property that all the numbers of a set can be multiplied by 
the same number without altering the mathematical mean- 
ing of that set. Both algorithms, Liu showed, proceed by 
multiplying the sets of numbers that enter into a problem, 
each by an appropriate factor, in such a way that some 
corresponding numbers of the sets are made equal and the 
other numbers are multiplied to keep intact the meaning of 
the whole sets. In the case of fractions, the denominators 
are made equal, and the numerators are changed appro- 
priately. For linear equations, the procedure is the same as 
if two numbers in the same row but in different columns 
were made equal by an appropriate multiplication, so that 
one of them can be eliminated through a column-to-col- 
umn subtraction; the whole columns are then multiplied by 
the same number so that the equations remain valid. Liu 
proceeded from these analogies to state new algorithms for 
the same problems. 


THE "TEN CLassics” 


For reasons that are still unclear, explications of the 
mathematical knowledge presupposed by The Nine 
Chapters (such as the numeration system and arithmetic 
operations) first appeared in later books that eventually 
were included in the “Ten Classics of Mathematics.” 
Most of the subjects dealt with in the later canonical 
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works of mathematics from ancient China relied on 
algorithms presented in The Nine Chapters, although 
sometimes they used versions of these algorithms that 
had a more limited range of applications. 

Nevertheless, it 1s possible to see an ongoing evo- 
lution of some of these topics, such as root extraction 
and the solution of equations. For example, Sunzi 
suanjing (“Sunzi’s Mathematical Classic”) and Zhang 
Qiujian suanjing (“Zhang Qiujian’s Mathematical Clas- 
sic”), both probably written before the 5th century and 
included in the “Ten Classics,” employed new descrip- 
tions of algorithms for the extraction of square and 
cube roots. The underlying procedures were the same, 
and they were still described in parallel ways, but the 
new descriptions showed more clearly the underlying 
mathematical object that is responsible for their similar- 
ity—namely, the equation. What changed in the descrip- 
tions was that, just as division involved a single divisor, 
square root extraction was shown to have two divisors 
and cube root extraction three divisors. (These divisors 
actually are coefficients of the equations that underlie 
the root extractions.) The divisors were shown to play 
similar roles in the algorithms. Moreover, in setting up 
the algorithms, the divisors were arranged one above 
the other, yielding a place-value notation for the under- 
lying equations: the row in which a number occurred 
was associated with the power of the unknown whose 
coefficient it was. However, at that time equations were 
neither written nor conceptualized in terms of such a 
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place-value notation. Early in the 7th century, Wang 
Xiaotong generalized the cube root extraction method 
to solve some third-degree equations using counting 
rods. It was only much later that the concept and rep- 
resentation of equations begat a full-fledged place- 
value notation. 

The “Ten Classics” also attests to research on top- 
ics that were not mentioned in The Nine Chapters but 
that were to be the subject of some of the highest math- 
ematical achievements of the Song and Yuan dynasties 
(960-1368). For example, “Sunzi’s Mathematical Clas- 
sic” presents this congruence problem: 


Suppose one has an unknown number of objects. If one 
counts them by threes, there remain two of them. If one 
counts them by fives, there remain three of them. If one 
counts them by sevens, there remain two of them. How 
many objects are there? 


The procedure used to solve the problem is difficult 
to understand, because it is described in a very condensed 
manner. But it clearly belongs to the tradition that eventu- 
ally led to a general algorithm for solving such problems. 


THEORY OF Root EXTRACTION AND EQUATIONS 


Research appears to have resumed in the 11th century 
with the reediting of the “Ten Classics” and the produc- 
tion of new commentaries. Within this context new devel- 
opments took place in branches of mathematics that had 
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been explored at least since The Nine Chapters, attesting 
to a continuity of mathematical practice. For example, 
regarding root extraction, in the 11th century Jia Xian is 
said to have given an algorithm for finding a fourth root 
using a method similar to the one now known as the 
Ruffini-Horner method. Jia’s algorithm operated on a 
column of rows set up on the counting surface in such 
a way that it still involved a place-value notation for the 
underlying equations. The intermediate values obtained 
in each row (actually the coefficients of the underlying 
equations) resulted from operations that involved only the 
numbers located in the rows below. Again, the algorithm 
made use of the configuration given to this set of numbers 
in an essential way. In addition, the procedures used to 
compute the successive numbers in any row were all the 
same. The new algorithm highlighted that the rows expe- 
rience the same transformations throughout the proce- 
dure—indicating a continued interest in the homogeneity 
of row operations in the descriptions of square and cube 
root extraction. As a consequence, division, square root 
extraction, and cube root extraction now appeared to be 
particular cases of the same general operation, which also 
covered extraction of nth roots. In fact, only the number 
of rows on which the algorithm operated determined the 
nature of the operation: three rows for a square root, four 
rows for a cube root, and so on. 

More generally, research on the solution of equa- 
tions also resumed and revealed that the same basic 
algorithm could be extended to find a root of any alge- 
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braic equation. The first step documented in this direc- 
tion, by the 11th- or 12th-century scholar Liu Yi, was 
finding roots of quadratic equations that have positive 
or negative coefficients. The coefficients, whatever their 
sign, were entered in the table for root extraction, and 
the square root algorithm was adapted to each situation. 

Later, Qin Jiushao’s Shushu jiuzhang (1247; “Math- 
ematical Treatise in Nine Chapters”) attested to the use 
of an algorithm extending Jia Xian’s procedure to find 
“the” root of any equation. (Most Chinese mathema- 
ticians still clung to the idea that an equation had just 
one proper solution.) By that time, general equations 
of any degree were used and were represented by a full- 
fledged place-value notation. This seems to indicate 
that it was the slow evolution of the algorithms of root 
extraction and their comparison that produced a fully 
developed concept of the equation. Similar methods 
(with a slightly different notation) were well known to Li 
Ye, and his Ceyuan haijing (“Sea Mirror of Circle Mea- 
surements”), written only one year after Qin completed 
his book, takes the search for the root of equations for 
granted. Li lived in North China, while Qin lived in the 
South, and is thought to have worked without know- 
ing Qin’s achievements. It is thus highly probable that 
these methods were well known before the middle of the 
13th century. 

In parallel to Jia Xian’s algorithm described above, 
another method developed for determining an nth root 
or finding the root of an equation of any degree, using 
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the coefficients of what is now called Pascal’s triangle and 
the same place-value representation. 


THE METHOD OF THE CELESTIAL UNKNOWN 


Li Ye’s book also contains a method, unknown to Qin 
Jiushao, that seems to have flourished in North China for 
some decades before Li completed “Sea Mirror of Circle 
Measurements.” This method explains how to use polyno- 
mial arithmetic to find equations to solve a problem. Li’s 
book is the oldest surviving work that explains this method, 
but it was probably not the first to deal with it. In this book 
polynomials are also arranged according to a positional 
notation. Thus, x —3x4+5+4 7x is represented as 


5 


2 


A character is added next to the 5 (replaced by a 
dot on the image) to indicate that it is a constant term. 
The location of the coefficient indicates the power of the 
indeterminate with which it is associated. This indeter- 
minate is called “the celestial unknown.” 

Research continued on these topics for several decades, 
as can be seen from the completion in 1299 of Suanxue 
gimeng (“Introduction to Mathematical Science”) by Zhu 
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Shiyie, which devotes some problems to presenting the 
“procedure of the celestial unknown.” Moreover, it is 
known that some mathematicians used this representation 
for polynomials in two or three unknowns; however, their 
writings are lost. In his second surviving book, Siyuan 
yujian (“Precious Mirror of the Four Elements”), Zhu 
made use of four unknowns. Starting from the centre of 
the counting board, in the two horizontal and the two 
vertical directions, he put in increasing order of their 
powers what came from each of the four unknowns. As 
soon as positive and negative powers of the indeterminates 
or too many mixed terms occurred, however, he had to 
use tricks that were in conflict with the principles of the 


Pascal's triangle 
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place-value notation. In problems where there was more 
than one unknown, he had to use a method of elimination 
of a common unknown between two equations. 


CHINESE REMAINDER THEOREM 


Qin Jiushao’s book also contains algorithms for the gen- 
eral congruence problem, an example of which was given 
in Sunzi’s Sth-century treatise, where its solution was too 
obscure to be understood. This problem amounts to deter- 
mining a number, the remainders of which are known 
when it is divided by given numbers (called moduli). There 
is no extant work between Sunzi’s treatise and Qin’s book 
of 1247 that reveals how this algorithm was elaborated. 
Such problems seem to have been worked out because of 
calendrical computation. Qin introduced his discussion by 
saying that his goal was to clarify several procedures used 
by astronomers who were applying them without under- 
standing them. His solution is known today as the Chinese 
remainder theorem. He dealt with the case when moduli 
are relatively prime, and he then reduced the case when 
they are not by first eliminating common factors. The first 
case 1s easily solved when x can be found that satisfies the 
congruence xa = | (mod b), a and b being two given rela- 
tively prime numbers (suppose a < b). Qin gave an algo- 
rithm for this, using a sequence of quotients in searching 
for the greatest common divisor of a and 6, whichis also the 
sequence of convergents for the continued fraction for b/a. 
Having them, he was then able to compute x. 
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Little is known about what happened to Chinese 
mathematics after Zhu Shijie, but surviving books from the 
following centuries attest to a progressive loss of the great 
achievements of the Song-Yuan period. In the 16th century 
a mathematician’s comments on Li Ye’s “Sea Mirror of Cir- 
cle Measurements” show that the method of the celestial 
unknown was no longer understood. By the 17th century 
it seems to have been completely forgotten. Rods were then 
no longer used as a counting tool, so perhaps Chinese alge- 
braic place-value notations, deprived of the instrument on 
which they were based, could not be understood. 

On the other hand, there was a rapid diffusion of 
the abacus, for which many books were written. One of 
them, the Suanfa tongzong (“Systematic Treatise on Mathe- 
matics”) by Cheng Dawei (1592), had a special significance. 
In addition to its detailed treatment of arithmetic on the 
abacus, it provided a summa of mathematical knowledge 
assembled by the author after 20 years of bibliographic 
research. Re-edited several times through the 19th century, 
the “Systematic Treatise” was the main source—and still 1s 
an important source—available to scholars in China, and 
more generally in East Asia, concerning mathematics as it 
developed in China’s tradition. 

When European missionaries arrived in China at the 
end of the 16th century, they found people interested in 
science (so that the missionaries were accepted in China 
because of their scientific knowledge) but unaware of 
their own past in mathematics. An era of translations 
of Western works then started, the first six books 
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of Euclid’s Elements being translated by the Jesuit Mat- 
teo Ricci and Xu Guangqi in 1607. In parallel to this pro- 
cess of translation, Chinese scholars attempted to find 
ancient books, to understand them, and to synthesize 
the Chinese and Western traditions. In the 18th century, 
with the help of Western algebra, Mei Juecheng deci- 
phered the ancient texts dealing with the method of the 
celestial unknown. This triggered a renewed search for 
ancient Chinese sources and attempts to revive mathe- 
matical research with traditional Chinese methods. 


MATHEMATICS IN JAPAN 


THE INTRODUCTION OF CHINESE Books 


Very little is known about Japanese mathematics before 
the 17th century. Beginning in the 7th century, at first 
only indirectly by way of Korea, there was a flow of Chi- 
nese science to Japan. For example, the “Ten Classics of 
Mathematics” was introduced, along with counting rods, 
probably by the 8th century. Yet no Japanese book deal- 
ing with mathematics survives from before the end of the 
16th century. At that time another phase of importation 
began: the abacus and Cheng Dawet’s “Systematic Treatise 
on Mathematics” became known in Japan, though they 
did not supplant the use of counting rods. Moreover, many 
books were brought from Korea, and perhaps in that way 
two Chinese books, Yang Hui suanfa (1275; “Yang Hut’s 
Methods of Mathematics”) and Zhu Shijie’s “Introduction 
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to Mathematical Science,” arrived in Japan. In those books, 
Japanese scholars could find algorithms for solving systems 
of simultaneous linear equations and for searching for the 
root of an equation according to methods used in China 
in the 13th century; they could also find applications of 
the method of the celestial unknown (although these were 
not immediately understood). In addition, books on calen- 
drical computations, which also contained mathematical 
knowledge, were imported. As a result of such infusions, 
Chinese mathematics greatly influenced the development 
of Japanese mathematics (for example, its algebraic orien- 
tation) and defined the context in which the Japanese tradi- 
tion later opened to European mathematics. 

At the beginning of the Tokugawa period (1603— 
1867), contacts with foreigners were limited to trade with 
Chinese and Dutch ships through the port of Nagasaki. 
Some Chinese books, which by then may have contained 
Western knowledge, as well as Dutch books entered Japan 
secretly, but it is difficult to state how much, or what kind 
of, mathematical knowledge entered through that channel. 


THE ELABORATION OF CHINESE METHODS 


Although not the first mathematical book written in 
Japan, Jingoki (“Inalterable Treatise”), published in 1627 by 
Yoshida Mitsuyoshi, seems to be the first book that played 
an important role in the emerging Japanese tradition. 
Inspired by the Chinese text “Systematic Treatise on Math- 
ematics,’ whose importance is stressed above, it described 
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ABACUS 


The abacus, plural abaci or abacuses, is a calculating device, 
probably of Babylonian origin, that was long important in com- 
merce. It is the ancestor of the modern calculating machine 
and computer. 

The earliest “abacus” likely was a board or slab on which 
a Babylonian spread sand so he could trace letters for gen- 
eral writing purposes. The word abacus is probably derived, 
through its Greek form abakos, from a Semitic word such as 
the Hebrew ibeq (“to wipe the dust’; noun abagq, “dust”). As 
the abacus came to be used solely for counting and comput- 
ing, its form was changed and improved. The sand (“dust”) 
surface is thought to have evolved into the board marked with 
lines and equipped with counters whose positions indicated 
numerical values—i.e., ones, tens, hundreds, and so on. In 
the Roman abacus the board was given grooves to facilitate 
moving the counters in the proper files. Another form, com- 
mon today, has the counters strung on wires. 

The abacus, generally in the form of a large calculating 
board, was in universal use in Europe in the Middle Ages, as 
well as in the Arab world and in Asia. It reached Japan in the 
16th century. The introduction of the Hindu-Arabic notation, with 
its place value and zero, gradually replaced the abacus, though 
it was still widely used in Europe as late as the 17th century and 
survives today in the Middle East, China, and Japan. 


in Japanese the use of the soroban, an improvement of the 
Chinese abacus, and introduced some Chinese knowledge. 
Its many editions contributed to popularizing mathemat- 
ics because most of the works on mathematics in Japan 
were written in Chinese and could not be widely read. In 
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its enlarged edition of 1641, Jingoki introduced the method 
of performing computations with counting rods, which 
by then were no longer used in China. Moreover, inspired 
by his Chinese source, Yoshida added “difficult problems” 
that he left without solutions and recommended be posed 
to mathematicians. This initiated a tradition of challenges, 
reminiscent of those that took place in Europe during the 
Renaissance, that strongly stimulated the development of 
mathematics in Japan. In this context, mathematicians 
in the 1650s, relying on counting-rod computations and 
looking for new methods of solution, began to decipher 
the original methods of Chinese algebra—hinted at in the 
1658 Japanese reprint of “Introduction to Mathematical 
Science’”—which enabled them to advance beyond the clas- 
sics. This contrasts with the situation in China, where the 
original methods could be understood only after the intro- 
duction of Western algebra. 

Various Japanese authors disseminated traditional 
Chinese methods for the solution of problems. Sawaguchi 
Kazuyuki’s Kokon sanpoki (1671; “Ancient and Modern 
Mathematics”) pointed out that “erroneous” problems 
could have more than one solution (in other words, equa- 
tions could have more than one root), but he left unan- 
swered difficult problems involving simultaneous equa- 
tions of the nth degree. Equations for their solution were 
published in 1674 by Seki Takakazu, now considered to 
be the founder of the Japanese tradition of mathematics, 
or wasan. Seki founded what became the most important 
school of mathematics in Japan. (At this time, mathemat- 
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ics was widely practiced in Japan as a leisure activity.) As 
in other schools, disciples had to keep the school methods 
secret, and only the best among them knew most of these 
methods. Only slowly did they publish their secrets, which 
hindered the free circulation of ideas and which makes any 
attribution very difficult. 

Explanations of how to use Seki’s equations to derive 
Sawaguchi’s problems were published in 1685 by one of 
Seki’s disciples, Takebe Katahiro. Seki had designed for 
this purpose a “literal” written algebra using characters, 
thus liberating mathematicians from counting rods. He 
kept for equations the positional notation with respect 
to one unknown, the coefficients being expressed in 
terms of numbers, parameters, or other unknowns. In 
establishing equations among several unknowns for the 
solution of a problem, he had to introduce procedures 
equivalent to computations of determinants in order to 
eliminate unknowns between simultaneous equations. 
Further research elaborated these procedures. 

Seki devised a classification of problems that 
amounted to a classification of equations, which took 
into consideration negative roots and multiple roots, the 
existence of which had been noticed by Sawaguchi; for 
this purpose he adapted the Chinese algorithms from the 
13th century. Seki and his disciples thus improved upon 
Chinese methods in many ways, opening new directions 
for the development of mathematics in Japan—as, for 
example, in their work on infinite series, the subject of 
research by contemporary European scientists as well. 


THE PHILOSOPHY OF 
MATHEMATICS 
SoH SSH 


‘ik: is a branch of philosophy that is concerned 


with two major questions: one concerning the 

meanings of ordinary mathematical sentences 
and the other concerning the issue of whether abstract 
objects exist. The first is a straightforward question of 
interpretation: What is the best way to interpret standard 
mathematical sentences and theories? In other words, 
what is really meant by ordinary mathematical sentences 
such as “3 is prime,” “2 + 2 = 4,” and “There are infinitely 
many prime numbers.” Thus, a central task of the phi- 
losophy of mathematics is to construct a semantic theory 
for the language of mathematics. Semantics is concerned 
with what certain expressions mean (or refer to) in ordi- 
nary discourse. So, for instance, the claim that in English 
the term Mars denotes the Mississippi River is a false 
semantic theory; and the claim that in English Mars 
denotes the fourth planet from the Sun is a true semantic 
theory. Thus, to say that philosophers of mathematics are 
interested in figuring out how to interpret mathematical 
sentences is just to say that they want to provide a seman- 
tic theory for the language of mathematics. 
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Philosophers are interested in this question for 
two main reasons: 1) it is not at all obvious what the 
right answer is, and 2) the various answers seem to have 
deep philosophical implications. More specifically, dif- 
ferent interpretations of mathematics seem to produce 
different metaphysical views about the nature of reality. 
These points can be brought out by looking at the sen- 
tences of arithmetic, which seem to make straightforward 
claims about certain objects. Consider, for instance, the 
sentence “4 is even.” This seems to be a simple sub- 
ject-predicate sentence of the form “S is P’—like, for 
instance, the sentence “The Moon is round.” This latter 
sentence makes a straightforward claim about the Moon, 
and likewise, “4 is even” seems to make a straightforward 
claim about the number 4. This, however, is where phi- 
losophers get puzzled. For it is not clear what the num- 
ber 4 is supposed to be. What kind of thing is a number? 
Some philosophers (antirealists) have responded here 
with disbelief—according to them, there are simply no 
such things as numbers. Others (realists) think that there 
are such things as numbers (as well as other mathematical 
objects). Among the realists, however, there are several 
different views of what kind of thing a number is. Some 
realists think that numbers are mental objects (something 
like ideas in people’s heads). Other realists claim that 
numbers exist outside of people’s heads, as features of 
the physical world. There is, however, a third view of the 
nature of numbers, known as Platonism or mathematical 
Platonism, that has been more popular in the history of 
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philosophy. This is the view that numbers are abstract 
objects, where an abstract object is both nonphysical and 
nonmental. According to Platonists, abstract objects exist 
but not anywhere in the physical world or in people’s 
minds. In fact, they do not exist in space and time at all. 

In what follows, more will be said to clarify exactly 
what Platonists have in mind by an abstract object. How- 
ever, it is important to note that many philosophers sim- 
ply do not believe in abstract objects. They think that to 
believe in abstract objects—objects that are wholly non- 
spatiotemporal, nonphysical, and nonmental—is to believe 
in weird, occult entities. In fact, the question of whether 
abstract objects exist is one of the oldest and most contro- 
versial questions of philosophy. The view that there do exist 
such things goes back to Plato, and serious resistance to the 
view can be traced back at least to Aristotle. This ongo- 
ing controversy has survived for more than 2,000 years. 

The second major question with which the philos- 
ophy of mathematics is concerned is this: “Do abstract 
objects exist?” This question is deeply related to the 
semantic question about how the sentences and theories 
of mathematics should be interpreted. For if Platonism 
is right that the best interpretation of mathematics is that 
sentences such as “41s even” are about abstract objects (and 
it will become clear below that there are some very good 
reasons for endorsing this interpretation), and if (what 
seems pretty obvious) sentences such as “4 is even” are 
true, then it would seem natural to endorse the view that 
abstract objects exist. 
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MATHEMATICAL PLATONISM 
FORMAL DEFINITION 


Mathematical Platonism, formally defined, is the view 
that (a) there exist abstract objects—objects that are 
wholly nonspatiotemporal, nonphysical, and nonmen- 
tal—and (b) there are true mathematical sentences that 
provide true descriptions of such objects. The discussion 
of Platonism that follows will address both (a) and (b). 

It is best to start with what is meant by an abstract 
object. Among contemporary Platonists, the most com- 
mon view is that the really defining trait of an abstract 
object is nonspatiotemporality. That is, abstract objects 
are not located anywhere in the physical universe, and they 
are also entirely nonmental, yet they have always existed 
and they always will exist. This does not preclude having 
mental ideas of abstract objects; according to Platonists, 
one can—e.g., one might have a mental idea of the num- 
ber 4. It does not follow from this, though, that the num- 
ber 4 is just a mental idea. After all, people have ideas of 
the Moon in their heads too, but it does not follow from 
this that the Moon is just an idea, because the Moon and 
people’s ideas of the Moon are distinct things. Thus, when 
Platonists say that the number 4 is an abstract object, they 
mean to say that it is a real and objective thing that, like 
the Moon, exists independently of people and their think- 
ing but, unlike the Moon, is nonphysical. 

Abstract objects are also, according to Platonists, 
unchanging and entirely noncausal. Because abstract 
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objects are not extended in space and not made of phys- 
ical matter, it follows that they cannot enter into cause- 
and-effect relationships with other objects. 

Platonists also claim that mathematical theorems 
provide true descriptions of such objects. What does 
this claim amount to? Consider the positive integers 
(1, 2, 3,...). According to Platonists, the theory of 
arithmetic says what this sequence of abstract objects 
is like. Over the years, mathematicians have discovered 
all sorts of interesting facts about this sequence. For 
instance, Euclid proved more than 2,000 years ago that 
there are infinitely many prime numbers among the posi- 
tive integers. Thus, according to Platonists, the sequence 
of positive integers is an object of study, just like the 
solar system is an object of study for astronomers. 

Now, so far, only one kind of mathematical object 
has been discussed, namely, numbers. But there are many 
different kinds of mathematical objects—functions, sets, 
vectors, circles, and so on—and for Platonists these are all 
abstract objects. Moreover, Platonists also believe that there 
are such things as set-theoretic hierarchies and that set the- 
ory describes these structures. And so on for all the various 
branches of mathematics. In general, according to Pla- 
tonists, mathematics is the study of the nature of various 
mathematical structures, which are abstract in nature. 

Platonism has been around for over two millennia, 
and over the years it has been one of the most popular 
views among philosophers of mathematics. Yet, for most 
of the history of philosophy, mathematical Platonism 
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was stagnant. In the late 19th century Gottlob Frege of 
Germany, who founded modern mathematical logic, 
developed what is widely thought to be the most powerful 
argument in favour of Platonism; but he did not alter the 
formulation of the view. Likewise, in the 20th century Kurt 
Godel of Austria and Willard Van Orman Quine of the 
United States introduced hypotheses in an attempt to 
explain how human beings could acquire knowledge of 
abstract objects—but again, neither of these thinkers 
altered the Platonist view itself. (Godel’s hypothesis was 
about the nature of human beings, and Quine’s hypothesis 
was about the nature of empirical evidence.) 


NONTRADITIONAL VERSIONS 


During the 1980s and ’90s, various Americans developed 
three nontraditional versions of mathematical Platonism: 
one by Penelope Maddy, a second by Mark Balaguer and 
Edward Zalta, and a third by Michael Resnik and Stewart 
Shapiro. All three versions were inspired by concerns over 
how humans could acquire knowledge of abstract objects. 

According to Maddy, mathematics is about abstract 
objects, and abstract objects are, in some important sense, 
nonphysical and nonmental, though they are located in 
space and time. Maddy developed this idea most fully in 
connection with sets. For her, a set of physical objects is 
located right where the physical objects themselves are 
located. For instance, if there are three eggs in a refrigerator, 
then the set containing those eggs is also in the refrigerator. 
This might seem eminently sensible, and one might wonder 
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why Maddy counts as a Platonist at all; that is, one might 
wonder why a set of eggs counts as a nonphysical object 
in Maddy’s view. In order to appreciate why Maddy is a 
Platonist (in some nontraditional sense), it is necessary to 
know something about set theory—most notably, that for 
every physical object, or pile of physical objects, there are 
infinitely many sets. For instance, if there are three eggs in 
a refrigerator, then corresponding to these eggs there exists 
the set containing the eggs, the set containing that set, the 
set containing that set, and so on. Moreover, there is also a 
set containing two different sets—namely, the set contain- 
ing the eggs and the set containing the set containing 
the eggs—and so on without end. Thus, combining the 
principles of set theory (which Maddy wants to pre- 
serve) with Maddy’s thesis that sets are spatiotemporally 
located implies that if there are three eggs in a given 
refrigerator, then there are also infinitely many sets in 
the refrigerator. Of course there is only a finite amount 
of physical stuff in the refrigerator. More specifically, it 
contains a rather small aggregate of egg-stuff. Thus, for 
Maddy the various sets built up out of this egg-stuff are 
all distinct from the aggregate itself. In order to avoid 
contradicting the principles of set theory, Maddy has to 
say that the sets are distinct from the egg-aggregate, and 
so even though she wants to maintain that all these sets 
are located in the refrigerator, she has to say that they 
are nonphysical in some sense. (Again, the reason that 
Maddy altered the Platonist view by giving sets spatio- 
temporal existence is that she thought it was necessary in 
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order to explain how anyone could acquire knowledge of 
abstract objects. 

According to Balaguer and Zalta, on the other hand, 
the only versions of Platonism that are tenable are those 
that maintain not just the existence of abstract objects but 
the existence of as many abstract objects as there can pos- 
sibly be. If this is right, then any system of mathematical 
objects that can consistently be conceived of must actually 
exist. Balaguer called this view “full-blooded Platonism,” 
and he argued that it is only by endorsing this view that 
Platonists can explain how humans could acquire knowl- 
edge of abstract objects. 

Finally, the nontraditional version of Platonism 
developed by Resnik and Shapiro is known as structur- 
alism. The essential ideas here are that the real objects of 
study in mathematics are structures, or patterns—things 
such as infinite series, geometric spaces, and set-theoretic 
hierarchies—and that individual mathematical objects 
(such as the number 4) are not really objects at all in the 
ordinary sense of the term. Rather, they are simply posi- 
tions in structures, or patterns. This idea can be clarified 
by thinking first about nonmathematical patterns. 

Consider a baseball defense, which can be thought 
of as a certain kind of pattern. There is a left fielder, a 
right fielder, a shortstop, a pitcher, and so on. These are 
all positions in the overall pattern, or structure, and they 
are all associated with certain regions on a baseball field. 
Now, when a specific team takes the field, real players 
occupy these positions. For instance, during the early 
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1900s Honus Wagner usually occupied the shortstop posi- 
tion for the Pittsburgh Pirates. He was a specific object, 
with spatiotemporal location. However, one can also 
think about the shortstop position itself. It is not an object 
in the ordinary sense of the term; rather, it is a role that 
can be filled by different people. According to Resnik and 
Shapiro, similar things can be said about mathematical 
structures. They are something like patterns, made up of 
positions that can be filled by objects. The number 4, for 
instance, is just the fourth position in the positive integer 
pattern. Different objects can be put into this position, 
but the number itself is not an object at all; it 1s merely 
a position. Structuralists sometimes express this idea by 
saying that numbers have no internal properties or that 
their only properties are those they have because of the 
relations they bear to other numbers in the structure; e.g., 
4 has the property of being between 3 and 5. This is anal- 
ogous to saying that the shortstop position does not have 
internal properties in the way that actual shortstops do; 
for instance, it does not have a height or a weight or a 
nationality. The only properties that it has are structural, 
such as the property of being located in or near the infield 
between the third baseman and the second baseman. 


MATHEMATICAL ANTI-PLATONISM 


Many philosophers cannot bring themselves to believe 
in abstract objects. However, there are not many tena- 
ble alternatives to mathematical Platonism. One option 
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is to maintain that there do exist such things as num- 
bers and sets (and that mathematical theorems provide 
true descriptions of these things) while denying that 
these things are abstract objects. Views of this kind can 
be called realistic versions of anti-Platonism. Like Pla- 
tonism, they are still versions of mathematical realism 
because they maintain that mathematical theorems pro- 
vide true descriptions of some part of the world. 

In contrast to realistic versions of anti-Platonism, 
there is also an antirealist view known as mathematical 
nominalism. This view rejects the belief in the existence 
of numbers, sets, and so on and also rejects the belief 
that mathematical theorems provide true descriptions of 
some part of the world. 

The two main alternatives to Platonism, then, are 
realistic anti-Platonism and nominalism. These alterna- 
tives are described more fully in the following two sections. 


ReEAListic ANTI-PLATONISM 


There are two different versions of realistic anti-Pla- 
tonism, namely, psychologism and physicalism. Psychol- 
ogism is the view that mathematical theorems are about 
concrete mental objects of some sort. In this view, num- 
bers and circles and so on do exist, but they do not exist 
independently of people; instead, they are concrete men- 
tal objects—in particular, ideas in people’s heads. As will 
become clearer below (in the section Mathematical Pla- 
tonism: for and against), psychologism has serious prob- 
lems and is no longer endorsed by many philosophers; 
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nonetheless, it was popular during the late 19th and early 
20th centuries, the most notable proponents being the 
German philosopher Edmund Husserl and the Dutch 
mathematicians L.E.J. Brouwer and Arend Heyting. 
Physicalism, on the other hand, is the view that math- 
ematics is about concrete physical objects of some sort. 
Advocates of this view agree with Platonists that there exist 
such things as numbers and sets, and, unlike adherents of 
psychologism, they also agree that these things exist inde- 
pendently of people and their thoughts. Physicalists differ 
from Platonists, however, in holding that mathematics 1s 
about ordinary physical objects. There are a few different 
versions of this view. For example, one might hold that geo- 
metric objects, such as circles, are regions of actual physical 
space. Similarly, sets might be claimed to be piles of actual 
physical objects—thus, a set of eggs would be nothing 
more than the aggregate of physical matter that makes up 
the eggs. Moving on to numbers, one strategy is to take 
them to be physical properties of some sort—for example, 
properties of piles of physical objects, so that, for instance, 
the number 3 might be a property of a pile of three eggs. 
It should be noted here that many people have endorsed a 
Platonistic view of properties. In particular, Plato thought 
that, in addition to all the red things he observed in the 
world, there exists an independent property of redness and 
that this property was an abstract object. Aristotle, on the 
other hand, thought that properties exist in the physical 
world; thus, in his view, redness exists in particular objects, 
such as red houses and red apples, rather than as an abstract 
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object outside of space and time. So in order to motivate a 
physicalistic view of mathematics by claiming that numbers 
are properties, one would also have to argue for an Aristo- 
telian, or physicalistic, view of properties. One person who 
has developed a view of this sort since Aristotle is the Aus- 
tralian philosopher David Armstrong. 

Another strategy for interpreting talk of numbers to 
be about the physical world is to interpret it as talk about 
actual piles of physical objects rather than properties of such 
piles. For instance, one might maintain that the sentence 
“2 + 3 = 5S” is not really about specific entities (the 
numbers 2, 3, and 5); rather, it says that when- 
ever a pile of two objects is pushed together with a 
pile of three objects, the result is a pile of five objects. 
A view of this sort was developed by the English philoso- 
pher John Stuart Mill in the 19th century. 


NomMINALISM 


Nominalism is the view that mathematical objects such 
as numbers and sets and circles do not really exist. Nom- 
inalists do admit that there are such things as piles of 
three eggs and ideas of the number 3 in people’s heads, 
but they do not think that any of these things is the num- 
ber 3. Of course, when nominalists deny that the number 
3 is a physical or mental object, they are in agreement 
with Platonists. They admit that if there were any such 
thing as the number 3, then it would be an abstract object; 
but, unlike mathematical Platonists, they do not believe 
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in abstract objects, and so they do not believe in num- 
bers. There are three different versions of mathematical 
nominalism: paraphrase nominalism, fictionalism, and 
what can be called neo-Meinongianism. 

The paraphrase nominalist view can be elucidated by 
returning to the sentence “4 is even.” Paraphrase nominal- 
ists agree with Platonists that if this sentence is interpreted 
at face value—.e., as saying that the object 4 has the prop- 
erty of being even—then it makes a straightforward claim 
about an abstract object. However, paraphrase nominal- 
ists do not think that ordinary mathematical sentences 
such as “4 is even” should be interpreted at face value; 
they think that what these sentences really say is different 
from what they seem to say 
on the surface. More specifi- 
cally, paraphrase nominalists 
think that these sentences do 
not make straightforward 
claims about objects. There 
are several different versions 
of paraphrase nominalism, 
of which the best known is 
“if-thenism,”’ or deductiv- 
ism. According to this view, 
the sentence “4 is even” can 
be paraphrased by the sen- 
tence “If there were such 
things as numbers, then 4 
Edmund Husserl, c. 1930 would be even.” In this view, 
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even if there are no such things as numbers, the sentence 
“4 is even” is still true. For, of course, even if there is no 
such thing as the number 4, it 1s still true that, if there 
were such a thing, it would be even. Deductivism has 
roots in the thought of David Hilbert, a brilliant German 
mathematician from the late 19th and early 20th centu- 
ries, but it was developed more fully by the American 
philosophers Hilary Putnam and Geoffrey Hellman. 
Other versions of paraphrase nominalism have been 
developed by the American philosophers Haskell Curry 
and Charles Chihara. 

Mathematical fictionalists agree with paraphrase 
nominalists that there are no such things as abstract 
objects and, hence, no such things as numbers. They think 
that paraphrase nominalists are mistaken, however, in 
their claims about what mathematical sentences such as 
“41s even” really mean. Fictionalists think that Platonists 
are right that these sentences should be read at face value; 
they think that “4 is even” should be taken as saying just 
what it seems to say—namely, that the number 4 has the 
property of being even. Moreover, fictionalists also agree 
with Platonists that if there really were such a thing as the 
number 4, then it would be an abstract object. But, again, 
fictionalists do not believe that there is such a thing as the 
number 4, and so they maintain that sentences like “4 is 
even” are not literally true. Fictionalists think that sen- 
tences such as “4 is even” are analogous in a certain way 
to sentences like “Santa Claus lives at the North Pole.” 
They are not literally true descriptions of the world, but 
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they are true in a certain well-known story. Thus, accord- 
ing to fictionalism, arithmetic is something like a story, 
and it involves a sort of fiction, or pretense, to the effect 
that there are such things as numbers. Given this pretense, 
the theory says what numbers are like, or what they would 
be like if they existed. Fictionalists then argue that it is 
not a bad thing that mathematical sentences are not liter- 
ally true. Mathematics is not supposed to be literally true, 
say the fictionalists, and they have a long explanation of 
why mathematics is pragmatically useful and intellectually 
interesting despite the fact that it is not literally true. Fic- 
tionalism was first proposed by the American philosopher 
Hartry Field. It was then developed in a somewhat differ- 
ent way by Balaguer, the American philosopher Gideon 
Rosen, and the Canadian philosopher Stephen Yablo. 
The last version of nominalism is neo-Mei- 
nongianism, which derives from Alexius Meinong, 
a late-19th century Austrian philosopher. Meinong 
endorsed a view that was supposed to be distinct from 
Platonism, but most philosophers now agree that it 
is in fact equivalent to Platonism. In particular, Mei- 
nong held that there are such things as abstract objects 
but that these things do not have full-blown existence. 
Philosophers have responded to Meinong’s claims by 
making a pair of related points. First, since Meinong 
thought there are such things as numbers, and since he 
thought that these things are nonspatiotemporal, it fol- 
lows that he was a Platonist. Second, Meinong simply 
used the word exist in a nonstandard way; according 
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to ordinary English, anything that is exists, and so it 1s 
contradictory to say that numbers are but do not exist. 

Advocates of neo-Meinongianism agree with Pla- 
tonists and fictionalists that the sentence “4 is even” should 
be interpreted at face value, as making (or purporting to 
make) a straightforward claim about a certain object— 
namely, the number 4. Moreover, they also agree that if 
there were any such thing as the number 4, then it would 
be an abstract object. Finally, they agree with fictionalists 
that there are no such things as abstract objects. In spite 
of this, neo-Meinongians claim that “4 is even” 1s literally 
true, for they maintain that a sentence of the form “The 
object O has the property P” can be literally true, even 
if there is no such thing as the object O. Thus, neo-Mei- 
nongianism consists in the following (seemingly awkward) 
trio of claims: (1) mathematical sentences should be read 
at face value, as purporting to make claims about math- 
ematical objects such as numbers; (2) there are no such 
things as mathematical objects; and yet (3) mathematical 
sentences are still literally true. Neo-Meinongianism, in the 
form described here, was first introduced by the New Zea- 
land philosopher Richard Sylvan, but related views were 
held much earlier by the German philosophers Rudolf 
Carnap and Carl Gustav Hempel and the British philos- 
opher Sir Alfred Ayer. Views along these lines have been 
endorsed by Graham Priest of England, Jody Azzouni of 
the United States, and Otavio Bueno of Brazil. 

In sum, then, there are essentially five alternatives to 
Platonism. If one does not want to claim that mathemat- 
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ics is about nonphysical, nonmental, nonspatiotemporal 
objects, then one must to claim either (1) that mathe- 
matics is about concrete mental objects in people’s heads 
(psychologism); or (2) that it is about concrete physical 
objects (physicalism); or (3) that, contrary to first appear- 
ances, mathematical sentences do not make claims about 
objects at all (paraphrase nominalism); or (4) that, while 
mathematics does purport to be about abstract objects, 
there are in fact no such things, and so mathematics is 
not literally true (fictionalism); or (5) that mathematical 
sentences purport to be about abstract objects, and there 
are no such things as abstract objects, and yet these sen- 
tences are still literally true (neo-Meinongianism). 


LoGIcIsM, INTUITIONISM, AND FORMALISM 


During the first half of the 20th century, the philoso- 
phy of mathematics was dominated by three views: log- 
icism, intuitionism, and formalism. Given this, it might 
seem odd that none of these views has been mentioned 
yet. The reason is that (with the exception of certain vari- 
eties of formalism) these views are not views of the kind 
discussed above. The views discussed above concern what 
the sentences of mathematics are really saying and what 
they are really about. But logicism and intuitionism are 
not views of this kind at all, and insofar as certain versions 
of formalism are views of this kind, they are versions of 
the views described above. How then should logicism, 
intuitionism, and formalism be characterized? In order to 
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understand these views, it is important to understand the 
intellectual climate in which they were developed. During 
the late 19th and early 20th centuries, mathematicians and 
philosophers of mathematics became preoccupied with 
the idea of securing a firm foundation of mathematics. 
That is, they wanted to show that mathematics, as ordi- 
narily practiced, was reliable or trustworthy or certain. 
It was in connection with this project that logicism, intu- 
itionism, and formalism were developed. 

The desire to secure a foundation for mathemat- 
ics was brought on in large part by the British philoso- 
pher Bertrand Russell’s discovery in 1901 that naive set 
theory contained a contradiction. It had been naively 
thought that for every concept, there exists a set of things 
that fall under that concept; for instance, corresponding 
to the concept “egg” is the set of all the eggs in the world. 
Even concepts such as “mermaid” are associated with a 
set—namely, the empty set. Russell noticed, however, that 
there is no set corresponding to the concept “not a mem- 
ber of itself.” For suppose that there were such a set—.e., 
a set of all the sets that are not members of themselves. 
Call this set S. Is Sa member of itself? If it is, then it is not 
(because all the sets in S are not members of themselves); 
and if Sis not a member of itself, then it is (because all the 
sets not in S are members of themselves). Either way, a 
contradiction follows. Thus, there is no such set as S. 

Logicism is the view that mathematical truths are ulti- 
mately logical truths. This idea was introduced by Frege. 
He endorsed logicism in conjunction with Platonism, but 
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logicism is consistent with various anti-Platonist views as 
well. Logicism was also endorsed at about the same time 
by Russell and his associate, British philosopher Alfred 
North Whitehead. Few people still endorse this view, 
although there is a neologicist school, the main proponents 
of which are the British philosophers Crispin Wright and 
Robert Hale. 

Intuitionism is the view that certain kinds of mathe- 
matical proofs (namely, nonconstructive arguments) are 
unacceptable. More fundamentally, intuitionism 1s best seen 
as a theory about mathematical assertion and denial. Intu- 
itionists embrace the nonstandard view that mathematical 
sentences of the form “The object O has the property P” 
really mean that there is a proof that the object O has the 
property P, and they also embrace the view that mathemati- 
cal sentences of the form “not-P” mean that a contradiction 
can be proven from P. Because intuitionists accept both of 
these views, they reject the traditionally accepted claim that 
for any mathematical sentence P, either P or not-P is true; 
and because of this, they reject nonconstructive proofs. Intu- 
itionism was introduced by L.E.J. Brouwer, and it was devel- 
oped by Brouwer’s student Arend Heyting and somewhat 
later by the British philosopher Michael Dummett. Brou- 
wer and Heyting endorsed intuitionism in conjunction with 
psychologism, but Dummett did not, and the view 1s consis- 
tent with various nonpsychologistic views—e.g., Platonism 
and nominalism. 

There are a few different versions of formalism. Per- 
haps the simplest and most straightforward 1s metamathe- 
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matical formalism, which holds that ordinary mathematical 
sentences that seem to be about things such as numbers are 
really about mathematical sentences and theories. In this 
view, “4 is even” should not be literally taken to mean that 
the number 4 is even but that the sentence “4 1s even” follows 
from arithmetic axioms. Formalism can be held simultane- 
ously with Platonism or various versions of anti-Platonism, 
but it is usually conjoined with nominalism. Metamathe- 
matical formalism was developed by Haskell Curry, who 
endorsed it in conjunction with a sort of nominalism. 


MATHEMATICAL PLATONISM: FoR 
AND AGAINST 


Philosophers have come up with numerous arguments for 
and against Platonism, but one of the arguments for Pla- 
tonism stands out above the rest, and one of the arguments 
against Platonism also stands out as the best. These argu- 
ments have roots in the writings of Plato, but the pro-Pla- 
tonist argument was first clearly formulated by Frege, and 
the locus classicus of the anti-Platonist argument is a 1973 
paper by the American philosopher Paul Benacerraf. 


THE FREGEAN ARGUMENT FOR PLATONISM 


Frege’s argument for mathematical Platonism boils 
down to the assertion that it is the only tenable view of 
mathematics. (The version of the argument presented 
here includes numerous points that Frege himself never 
made; nonetheless, the argument is still Fregean in spirit.) 
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From the Platonist point of view, the weakest 
anti-Platonist views are psychologism, physicalism, and 
paraphrase nominalism. These three views make contro- 
versial claims about how the language of mathematics 
should be interpreted, and Platonists rebut their claims 
by carefully examining what people actually mean when 
they make mathematical utterances. The following brings 
out some of the arguments against these three views. 

Psychologism can be thought of as involving two 
central claims: (1) number-ideas exist inside people’s heads 
and (2) ordinary mathematical sentences and theories 
are best interpreted as being about these ideas. Very few 
people would reject the first of these theses, but there are 
several well-known argu- 
ments against accepting 
the second view. Three 
are presented here. First 
is the argument that psy- 
chologism makes mathe- 
matical truth contingent 
upon psychological truth. 
Thus, if every human 
being died, the sentence 
“2 + 2 = 4” would sud- 
denly become untrue. 
This seems blatantly 
wrong. The second argu- 
ment is that psychologism 
seems incompatible with 


Alfred Whitehead 
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standard arithmetical theory, which insists that infinitely 
many numbers actually exist, because clearly there are only 
a finite number of ideas in human heads. This is not to say 
that humans cannot conceive of an infinite set; the point 
is, rather, that infinitely many actual objects (1.¢., distinct 
number-ideas) cannot reside in human heads. Therefore, 
numbers cannot be ideas in human heads. Third, psycholo- 
gism suggests that the proper methodology for mathemat- 
ics 1s that of empirical psychology. If psychologism were 
true, then the proper way to discover whether, say, there is 
a prime number between 10,000,000 and 10,000,020 would 
be to do an empirical study of humans to ascertain whether 
such a number existed in someone’s head. This, however, 
is obviously not the proper methodology for mathematics; 
the proper methodology involves mathematical proof, not 
empirical psychology. 

Physicalism does not fare much better in the eyes of 
Platonists. The easiest way to bring out the arguments 
against physicalistic interpretations of mathematics is to 
focus on set theory. According to physicalism, sets are just 
piles of physical objects. But, as has been previously shown, 
sets cannot be piles of physical stuff—or at any rate, when 
mathematicians talk about sets, they are not talking about 
physical piles—because it follows from the principles of 
set theory that for every physical pile, there corresponds 
infinitely many sets. A second problem with physicalistic 
views 1s that they seem incapable of accounting for the sheer 
size of the infinities involved in set theory. Standard set 
theory holds not just that there are infinitely large sets but 
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also that there are infinitely many sizes of infinity, that these 
sizes get larger and larger with no end, and that there actu- 
ally exist sets of all of these different sizes of infinity. There 
is simply no plausible way to take this sort of mathematical 
theorizing about the infinite to be about the physical world. 
Finally, a third problem with physicalism in Platonists’ eyes 
is that it also seems to imply that mathematics is an empir- 
ical science, contingent on physical facts and susceptible to 
empirical falsification. This seems to contradict mathemat- 
ical methodology; mathematics is not empirical (at least 
not usually), and most mathematical truths (e.g., “2 + 3 = 
5”) cannot be empirically falsified by discoveries about the 
nature of the physical world. 

Platonists argue against the various versions of 
paraphrase nominalism by pointing out that they are also 
out of step with actual mathematical discourse. These 
views are all committed to implausible hypotheses about 
the intentions of mathematicians and ordinary folk. For 
instance, deductivism is committed to the thesis that when 
people utter sentences such as “4 is even,” what they really 
mean to say is that, if there were numbers, then 4 would 
be even. However, there simply is no evidence for this 
thesis, and, what is more, it seems obviously false. Similar 
remarks can be made about the other versions of para- 
phrase nominalism; all of these views involve the same 
idea that mathematical statements are not used literally. 
There is no evidence, however, that people use mathemati- 
cal sentences nonliterally. It seems that the best interpreta- 
tion of mathematical discourse takes it to be about (or at 
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any rate, to purport to be about) certain kinds of objects. 
Furthermore, as has already been shown, there are good 
reasons to think that the objects in question could not be 
physical or mental objects. Thus, the arguments outlined 
here seem to lead to the Platonistic conclusion that math- 
ematical discourse 1s about abstract objects. 

It does not follow from this that Platonism is true, 
however, because anti-Platonists can concede all these 
arguments and still endorse fictionalism or neo-Mei- 
nongianism. Advocates of the neo-Meinongian view 
accept the eminently plausible Platonistic interpretation 
of mathematical sentences while also denying that there 
are any such things as numbers and functions and sets; 
but then neo-Meinongians want to claim that mathemat- 
ics is true anyway. Platonists argue that this reasoning 
is absurd. For instance, if mermaids do not exist, then 
the sentence “There are some mermaids with red hair” 
cannot be literally true. Likewise, if there are no such 
things as numbers, then the sentence “There are some 
prime numbers larger than 20” cannot be literally true 
either. Perhaps the best thing to say here is that neo-Mel- 
nongianism warps the meaning of the word true. 

The one remaining group of anti-Platonists, the 
fictionalists, agree with Platonists on how to interpret 
mathematical sentences. In fact, the only point on which 
fictionalists disagree with Platonists is the bare question 
of whether there exist any such things as abstract objects 
(and, as a result, the question about whether mathemat- 
ical sentences are literally true). However, since abstract 
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objects must be nonphysical and nonmental if they exist at 
all, itis not obvious how one could ever determine whether 
they exist. This is the beauty of the fictionalists’ view: they 
endorse all of the Platonists’ arguments that mathematics 
is best interpreted as being about abstract objects, and 
then they simply assert that they do not believe in abstract 
objects. It might seem very easy to dispense with fictional- 
ism, because it might seem utterly obvious that sentences 
such as “2 + 2 = 4” are true. On closer inspection, how- 
ever, this is not at all obvious. If the arguments discussed 
above are correct—and Platonists and fictionalists both 


INFINITY 


Infinity is the concept of something that is unlimited, end- 
less, without bound. The common symbol for infinity, °, was 
invented by the English mathematician John Wallis in 1657. 
Three main types of infinity may be distinguished: the math- 
ematical, the physical, and the metaphysical. Mathematical 
infinities occur, for instance, as the number of points on a 
continuous line or as the size of the endless sequence of 
counting numbers: 1, 2, 3,.... Spatial and temporal concepts 
of infinity occur in physics when one asks if there are infinitely 
many stars or if the universe will last forever. In a metaphys- 
ical discussion of God or the Absolute, there are questions 
of whether an ultimate entity must be infinite and whether 
lesser things could be infinite as well. 

The ancient Greeks expressed infinity by the word apeiron, 
which had connotations of being unbounded, indefinite, unde- 
fined, and formless. One of the earliest appearances of infinity 
in mathematics regards the ratio between the diagonal and the 
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side of a square. Pythagoras (c. 580-500 BCE) and his followers 
initially believed that any aspect of the world could be expressed 
by an arrangement involving just the whole numbers (0, 1, 2, 
3,...), but they were surprised to discover that the diagonal and 
the side of a square are incommensurable—that is, their lengths 
cannot both be expressed as whole-number multiples of any 
shared unit (or measuring stick). In modern mathematics this 
discovery is expressed by saying that the ratio is irrational and 
that it is the limit of an endless, nonrepeating decimal series. In 
the case of a square with sides of length 1, the diagonal is v2, 
written as 1.414213562..., where the ellipsis (...) indicates an 
endless sequence of digits with no pattern. 

Both Plato (428/427-348/347 BCE) and Aristotle (384— 
322 BCE) shared the general Greek abhorrence of the notion 
of infinity. Aristotle influenced subsequent thought for more 
than a millennium with his rejection of “actual” infinity (spa- 
tial, temporal, or numerical), which he distinguished from the 
“potential” infinity of being able to count without end. To avoid 
the use of actual infinity, Eudoxus of Cnidus (c. 400-350 BCE) 
and Archimedes (c. 287—212/211 BCE) developed a tech- 
nique, later known as the method of exhaustion, whereby an 
area was calculated by halving the measuring unit at succes- 
sive stages until the remaining area was below some fixed 
value (the remaining region having been “exhausted”). 

The issue of infinitely small numbers led to the discovery 
of calculus in the late 1600s by the English mathematician Isaac 
Newton and the German mathematician Gottfried Wilhelm Leib- 
niz. Newton introduced his own theory of infinitely small num- 
bers, or infinitesimals, to justify the calculation of derivatives, or 
slopes. In order to find the slope (that is, the change in y over 
the change in x) for a line touching a curve at a given point 
(x, y), he found it useful to look at the ratio between dy and ax, 
where dy is an infinitesimal change in y produced by moving 
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an infinitesimal amount dx from x. Infinitesimals were heavily 
criticized, and much of the early history of analysis revolved 
around efforts to find an alternate, rigorous foundation for the 
subject. The use of infinitesimal numbers finally gained a firm 
footing with the development of nonstandard analysis by the 
German-born mathematician Abraham Robinson in the 1960s. 


accept them—then in order for “2 + 2 = 4” to be true, 
abstract objects must exist. But one might very well doubt 
that there really do exist such things; after all, they seem 
more than a bit strange, and what is more, there does not 
seem to be any evidence that they really exist. 

Or maybe some evidence does exist. This, at any rate, 
is what Platonists want to claim. Platonists have offered a 
few different arguments as refutations of fictionalism, but 
only one of them, known as the indispensability argument, 
has gained any real currency. According to the indispens- 
ability argument, well-established mathematical theorems 
must be true because they are inextricably woven into the 
empirical theories that have been developed and accepted 
in the natural sciences, and there are good reasons to think 
that these empirical theories are true. (This argument has 
roots in the work of Frege and has been developed by Quine 
and Putnam.) Fictionalists have offered two responses to 
this argument. Field has argued that mathematics is not 
inextricably woven into the empirical theories that scien- 
tists have developed; if scientists wanted, he has argued, 
they could extract mathematics from their theories. Fur- 
thermore, Balaguer, Rosen, and Yablo have argued that it 
does not matter whether mathematics is indispensable to 
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empirical science because even if it is, and even if mathe- 
matical theorems are not literally true (because there are 
no such things as abstract objects), the empirical theories 
that use these mathematical theorems could still provide 
essentially accurate pictures of the physical world. 


THE EPISTEMOLOGICAL ARGUMENT 
AGAINST PLATONISM 


The epistemological argument is very simple. It is based 
on the idea that, according to Platonism, mathematical 
knowledge is knowledge of abstract objects, but there 
does not seem to be any way for humans to acquire knowIl- 
edge of abstract objects. The argument for the claim that 
humans could not acquire knowledge of abstract objects 
proceeds as follows: 


¢ (1) Humans exist entirely within space-time. 


¢ (2) If there exist any abstract objects, then they exist 
entirely outside of space-time. 


¢ (3) Therefore, it seems that humans could never 
acquire knowledge of abstract objects. 

There are three ways for Platonists to respond to 
this argument. They can reject (1), they can reject (2), 
or they can accept (1) and (2) and explain why the very 
plausible sounding (3) is nonetheless false. 

Platonists who reject (1) maintain that the human mind 
is not entirely physical and that it is capable of somehow 
forging contact with abstract objects and thereby acquiring 
information about such objects. This strategy was pursued 
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by Plato and Godel. According to Plato, people have imma- 
terial souls, and before birth their souls acquire knowledge 
of abstract objects, so that mathematical learning is really 
just a process of recollection. For Gédel, humans acquire 
information about abstract objects by means of a faculty 
of mathematical intuition—in much the same way that 
information about physical objects is acquired through 
sense perception. 

Platonists who reject (2) alter the traditional Platonic 
view and maintain that, although abstract objects are non- 
physical and nonmental, they are still located in space-time; 
hence, according to this view, knowledge of abstract objects 
can be acquired through ordinary sense perceptions. Maddy 
developed this idea in connection with sets. She claimed that 
sets of physical objects are spatiotemporally located and 
that, because of this, people can perceive them—that is, see 
them and taste them and so on. For example, suppose that 
Maddy 1s looking at three eggs. According to her view, she 
can see not only the three eggs but also the set containing 
them. Thus, she knows that this set has three members sim- 
ply by looking at 1t—analogous to the way that she knows 
that one of the eggs 1s white just by looking at it. 

Platonists who accept both (1) and (2) deny that 
humans have some sort of information-gathering contact 
with abstract objects in the way proposed by Plato, Godel, 
and Maddy; but these Platonists still think that humans can 
acquire knowledge of abstract objects. One strategy that 
Platonists have used here is to argue that people acquire 
knowledge of abstract mathematical objects by acquiring 
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evidence for the truth of their empirical scientific theories; 
the idea is that this evidence provides reason to believe 
all of empirical science, and science includes claims about 
mathematical objects. Another approach, developed by 
Resnik and Shapiro, is to claim that humans can acquire 
knowledge of mathematical structures by means of the 
faculty of pattern recognition. They claim that mathe- 
matical structures are nothing more than patterns, and 
humans clearly have the ability to recognize patterns. 
Another strategy, that of full-blooded Platonism, is 
based on the claim that Platonists ought to endorse the 
thesis that all the mathematical objects that possibly could 
exist actually do exist. According to Balaguer, if full- 
blooded Platonism is true, then knowledge of abstract 
objects can be obtained without the aid of any informa- 
tion-transferring contact with such objects. In particular, 
knowledge of abstract objects could be obtained via the 
following two-step method (which corresponds to the 
actual methodology of mathematicians): first, stipulate 
which mathematical structures are to be theorized about 
by formulating some axioms that characterize the struc- 
tures of interest; and second, deduce facts about these 
structures by proving theorems from the given axioms. 
For example, if mathematicians want to study the 
sequence of nonnegative integers, they can begin with axi- 
oms that elaborate its structure. Thus, the axioms might say 
that there is a unique first number (namely, 0), that every 
number has a unique successor, that every nonzero number 
has a unique predecessor, and so on. Then, from these axi- 
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oms, theorems can be proven—for instance, that there are 
infinitely many prime numbers. This 1s, in fact, how math- 
ematicians actually proceed. The point here is that full- 
blooded Platonists can maintain that by proceeding in this 
way, mathematicians acquire knowledge of abstract objects 
without the aid of any information-transferring contact 
with such objects. Put differently, they maintain that what 
mathematicians have discovered 1s that, in the sequence of 
nonnegative integers (by which is just meant the part or 
parts of the mathematical realm that mathematicians have 
in mind when they select the standard axioms of arith- 
metic), there are infinitely many prime numbers. Without 
full-blooded Platonism this cannot be said, because tradi- 
tional Platonists have no answer to the question “How do 
mathematicians know which axiom systems describe the 
mathematical realm?” In contrast, this view entails that 
all internally consistent axiom systems accurately describe 
parts of the mathematical realm. Therefore, full-blooded 
Platonists can say that when mathematicians lay down 
axiom systems, all they are doing is stipulating which parts 
of the mathematical realm they want to talk about. Then 
they can acquire knowledge of those parts simply by prov- 
ing theorems from the given axioms. 


ONGOING IMPASSE 


Just as there is no widespread agreement that fictionalists 
can succeed in responding to the indispensability argu- 
ment, there is no widespread agreement that Platonists can 
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adequately respond to the epistemological argument. It 
seems to this writer, though, that both full-blooded Pla- 
tonism and fictionalism can be successfully defended 
against all of the traditional arguments brought against 
them. Recall that Platonism and fictionalism agree on how 
mathematical sentences should be interpreted—that is, 
both views agree that mathematical sentences should be 
interpreted as being statements about abstract objects. On 
the other hand, Platonism and fictionalism disagree on the 
metaphysical question of whether abstract objects exist, 
and an examination of the foregoing debate does not pro- 
vide any compelling reason to endorse or reject either view 
(though some reasons have proved plausible and attractive 
enough to persuade people to take sides on this question). 
In fact, humanity seems to be cut off in principle from ever 
knowing whether there are such things as abstract objects. 
Indeed, it seems to this writer that it is doubtful that a cor- 
rect answer even exists. For it can be argued that the concept 
of an abstract object is so unclear that there 1s no objective, 
agreed-upon condition that would need to be satisfied in 
order for it to be true that there are abstract objects. This 
view of the debate is extremely controversial, however. 


abacus A calculating device that uses counters that 
represent numerical values. Early abaci were often 
constructed of a board or slab upon which letters or 
numbers could be traced in sand. 


arithmetic Branch of mathematics in which numbers, 
relations among numbers, and observations on num- 
bers are studied and used to solve problems. This 
includes measurement and computation including 
addition, subtraction, multiplication, division, raising 
to powers, and extraction of roots. 

astrology A type of divination wherein events can 


supposedly be foreseen because of the positions of 
the Sun, Moon, and planets. 


astronomy The study of all objects outside of Earth’s 
atmosphere, including the evolution, physics, chemis- 
try, and motion of celestial objects. 


axiom A rule that is generally accepted. 


calculus Branch of mathematics, developed by Isaac 
Newton and Gottfried Wilhelm Leibniz, concerned 
with the calculation of instantaneous rates of change 
and the summation of infinitely many small factors 
to determine some whole. 


chord A line segment joining two points on a curve. 


density Weight per unit volume of a liquid, solid, or gas. 
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ellipsoid A symmetrical object of which all plane cross 
sections are either ellipses or circles, with three mutu- 
ally perpendicular axes that intersect at the centre. 


geometry The branch of mathematics concerned with 
the shape and size of individual objects, position and 
distance among various objects, and the properties of 
surrounding space. 


heliocentric Relating to the theory that Earth and the 
planets revolve around the Sun and that it is station- 
ary and at the centre of the universe. 


hieroglyphic A system of writing that uses characters 
in the form of pictures. 


hyperbola A two-branched open curve produced by 
the intersection of a circular cone and a plane that 
cuts through both halves of the cone. 


papyrus Material derived from the papyrus plant that 
can be pressed into a smooth surface that can be 
written upon. 


parabola Open curve produced by the intersection of 
a circular cone and a plane parallel to an element of 
the cone. 


paradox A statement that seems to contradict itself, 
but has an underlying meaning after some consider- 
ation, making it true. 


postulate Another word for axiom, or a rule that is 
generally accepted. 
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proportion A relation between parts, wherein one part 
is a constant multiple of the other or they share a 
constant ratio. 


Pythagorean theorem Theorem that states that the 
sum of the squares of the legs of a right triangle is 
equal to the square of the hypotenuse. 


ratio A comparison between two or more things com- 
paring quantity, amount, or size. 

reciprocal A multiplicative inverse; for instance the 
reciprocal of 3 would be 1/3. 

sage A wise person or scholar. 

sexagesimal Relating to or based on the number sixty; 
sixty being the smallest number divisible by every 
number from | to 6. 


theorem A statement or proposition that is proven by 
formulas and accepted as truth. 


trigonometry The branch of mathematics concerned 
with the functions of angles and the relationships 
between the sides and angles of triangles. 


velocity Quantity that defines how fast and in what 
direction an object 1s moving. 
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ing Russell’s own view. Stephan Kérner, The Philosophy of 
Mathematics (1960, reissued 1986), is a classic introductory 
overview of the debate between logicists, intuitionists, and 
formalists during the first half of the 20th century. Stewart 
Shapiro, Thinking About Mathematics (2000), is a very good 
recent book that provides a more general introduction to the 
philosophy of mathematics. Mark Balaguer, Platonism and 
Anti-Platonism in Mathematics (1998, reissued 2001), pro- 
vides an overview of various theories in the philosophy of 
mathematics, while arguing for a very specific, original view 
of its own. Finally, two works written by mathematicians 
are G.H. Hardy, A Mathematician s Apology, rev. ed. (1969, 
reissued 1999); and Reuben Hersh, What Is Mathematics, 
Really? (1997). 
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